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MULTIVALENT VACCINE FOR CLOSTRIDIUM BOTULINUM NEUROTOXIN 

FIELD OF THE INVENTION 

The present invention relates to the isolation of polypeptides derived from Clostridium 
hotulinum neurotoxins and the use thereof as immunoucns for the production of vaccines, 
including multivalent vaccines, and antitoxins. 

BACKGROUND OF THE INVENTION 

The uenus Clostridium is comprised of gram-positive, anaerobic, spore-forming bacilli. 
The natural habitat of these organisms is the environment and the intestinal tracts of humans 
and other animals. Indeed. Clostridia are ubiquitous: they are commonly found in soil. dust, 
sewage, marine sediments, decaying vegetation, and mud. [.Sec? eyi., P H. A. Sneath ct aL. 
"Clostridium" Bergeys Manual* of Systematic Bacteriology. Vol. 2. pp. 1141-1200. 
Williams & Wilkins (1986).| Despite the identification of approximately 100 species of 
C lostridium, only a small number have been recognized as etiolouic agents of medical and 
veterinary importance. Nonetheless, these species are associated with very serious diseases, 
including botulism, tetanus, anaerobic cellulitis, gas gangrene, bacteremia, pseudomembranous 
colitis, and clostridial gastroenteritis. Table I lists some of the species of medical and 
veterinary importance and the diseases with which they are associated. As virtually all of 
these species have been isolated from fecal samples of apparently healthy persons, some of 
these isolates may be transient, rather than permanent residents of the colonic flora. 

TABLE 1 



Ctaxtrkiium Species Of Medical And Veterinary Importance* 



Species 


Disease 


C *. umhunaicricum 


Bacteriuria (pregnant women \ 


C. argent inense 


Infected wounds: Bacteremia: Botulism: Infections of amniotic fluid 


C\ haratii 


Infected war wounds: Peritonitis: Infectious processes of the eye. ear 
and prostate 


C. hcifcrinckikii 


Infected wounds 


C. h i ferment am 


Infected wounds: Abscesses: Gas Gangrene: Bacteremia 


t\ hotiiiinuin 


Food poisoning: Botulism (wound, food, infant) 


C. hutyricum 


Urinary tract, lower respiratory tract, pleural cavity, and abdominal 
infections: Infected wounds: Abscesses: Bacteremia 


t\ vtuiaveris 


Abscesses: Infected wounds 
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TABLE I 

f Species Of Medical And Veterinary Importance* 

Disease 1 J 


\ C. L /U4ttY(H'i 

I (*. viustndiofontw 
| C c*H'hfcann/n 


Soft tissue infections: Bacteremia 1 
Blackleg 1 

^^Xr aL p,euraL a,,d °"~ infec,io " s: J 


1 ( dtffhitc 

I C A///uv 
J C* sihnoti 


_ Isolated from human disease processes, but role in disease unknown 

Annmicrobial-associa.^ diarrhea: Pseudomembranous enterocolitis 
Bacteremia: Pyogenic infections unv 

Soft tissue infections 

Sot! tissue infections I 


| (' s*!\ coin urn 
1 < fiit.\(if<trmc 

I € iff Vi*Mlt*i *f****rn 


Wound infections: Abscesses: Peritonitis I 
Infected war wounds: Bacteremia: Abscesses 


1 f />tf//j/ii' 


Infected war wounds: Gas gangrene: Ciineival plaque isolate 
Gastrointestinal tract infections 


• 1 l tfi't'l'nJfU'f 


Gastrointestinal tract infections: Empyema 

Penile lesions j 


1 < ' fc/VtlM 


Isolated from human disease processes, but ml* m h.w- linf - lllnvn 


| < tmtU*ntNnt$utttmi 


Bacteremia: Pentonitis: Pulmonary infections 
Various infectious processes 


1 mivvt 




. J C*. ttnttti ttm 


■Urinary tract infections: Rectal abscesses | 


(' />ittii/wf f tftciiftt 


Bacteremia: Peritonitis: Infected wounds: Appendicitis 




Gas gangrene: Anaerobic cellulitis; Intra-abdominal abscesses Sol, 
tissue infections, food poisoning: Necrotizing pneumonia: Lmpvcma 
Meningitis Bacteremia: Uterine Infections: Lnteritis necrotans I an* 
dysentery: Struck: Ovine Enicroioxemia- 


I < ' I'Utrcfacicnx 
[ ( '. putfifivum 


Bacterium (Pregnant women with bacteremia ) 
Abscesses: Infected wounds: Bacteremia 


I f latuasuni 


It) N>Pf trine tK„ ^1. J ' i 1 

Mtccons of the abdominal ci.vMy. yemial .met. lun*. ; ,„d bil.nrv m.« 
Bacteremia | 


| ( xarntiznfnmic 
1 f \ xcpticttm 


Isolated from htininn ftic**iv:*» rtr/wo, , i. . . ] 

""'"an aisca.se processes, but role m disease unknown 

Gas gangrene: Bacteremia: Suppurative infections: Necrotizing t 
enterocolitis: Braxy 


C xttrddfh 


Gas gangrene: Wound infectious: Penile lesions. Bacteremia 
Abscesses: Abdominal and vaginal infections 
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TABLE I 



Species 


Disease 


( *. sphetu uJcs 


Appendicitis; Bacteremia* Bone mul <nti tic<ui*» intwiinnc- 
Jntrapcriioneal infections: Infected war wounds: Visceral gas nangrene; 
Renal abscesses 




Gas gangrene; Bacteremia: Endocarditis: central nervous system and 
pleuropulmonary infections: Penile lesions: Infected war wounds: 
Other ovo* f enic infections 


( sithtcrmmalc 


Bacteremia; Empyema: Biliary tract, soft tissue and bone infections 


( " svmhiasum 


Liver abscesses: Bacteremia: Injections resulting due to bowel Mora 


i tcrtttwt 


Gas gangrene: Appendicitis: Brain abscesses: Intestinal tract and soft 
tissue infections: Infected war wounds: Periodontitis: Bacteremia 


< '. teta/ti 


Tetanus: Infected gums and teeth: Corneal ulcerations: Mastoid and 
middle ear infections: Intraperitoneal infections: Tetanus neonatorum: 
Postpartum uterine infections: Soft tissue infections, especially related 
lo trauma (including abrasions and lacerations): Infections related to 
use of contaminated needles 


( \ fhcrittosaccharohiictttn 


Isolated from human disease processes, but role in disease unknown. 
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l ompiled from P.G. Lngelkirk et ul "Ctasstfica/iotr. Principles and Practice nf Clinical Anucrohic 
tiacnnula^v. pp. 22-23. Star Publishing Co.. Belmont. CA ( 1992): J. Stephen and'K.A. Petrowski. 
"ftnins Mitch 7raverse Membranes and Deregulate Celts/' in liactcrutl Toxins. 2d ed pp. 66-67 
American Society for Microbiology < 1986): R. Berkow and A J. I I etcher (eds.l. Viactenal Diseases." 
Merck Manual nf Diastasis and Therapy. 16th ed.. pp. 1 16-126. Merck Research Laboratories. Railway. 
\.J. (1992): and O IL Sigmund and CM. f-raser (eds ). "Clostridial In/ectinns." Merck f eterinary 
Manual. 5th ed.. pp. 396-409. Merck & Co.. Rahwav. N.J. (1979) 
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In mosi cases, the pathogenicity of these organisms is related tt> the release ol powerful 
exotoxins or highly destructive enzymes. Indeed, several species of the genus ( lostrkiium 
produce toxins and other enzymes of great medical and veterinary significance. [C.L. 
Ilatheway. Clin. Microbiol. Rev. 3:66-98 (I990).| 

Perhaps because of their significance for human and veterinary medicine, much 
research has been conducted on these toxins, in particular those of C hom/imtm and C 
difficile. 



C hot u tin urn 

Several strains of Clostridium hoiuiimtm produce toxins of significance to human and 
animal health. |C.L. Haiheway. Clin. Microbiol. Rev. 3:66-98 (I990)| The effects of these 
toxins range from diarrheal diseases that can cause destruction of the colon, lo paralytic 
effects that can cause death. Particularly at risk for developing clostridial diseases are 
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neonates and humans and animals in poor health (e..t;.. those suffering from diseases 
associated with old age or immunodeficiency diseases). 

. Clostridium hotulimtm produces the most poisonous biological toxin known. The 

lethal human dose is a mere 10" mg/kg bodyweight for toxin in the bl«K>dstrcam. Botulinal 
loxin blocks nerve transmission to the muscles, resulting in llaccid paralysis. When the toxin 
reaches airway and respiratory muscles, it results in respiratory failure that can cause death. 
|S. Arnon. .1. Infect. Dis. 154:201-206 (I986)| 

T. hotulinum spores are carried by dust and are found on v egetables taken from the 
soil, on fresh fruits, and on agricultural products such as honey. ,1 inder conditions favorable 
to the organism, the spores germinate to vegetative cells which produces toxin. [S Arnon. 
Ann. Rev. Med. 31:541 (1980)| 

Botulism disease may be grouped into four types, based on the method of introduction 
of toxin into the bloodstream. Food-borne botulism results from ingesting improperlv 
preserved and inadequately heated food that contains botulinal loxin. [ here were 355 cases of 
food-borne botulism in the United States between 1976 and 1984. |K.I.. MacDonald ct at.. 
Am. J. Epidemiol. 124:794 ( 1986) | The death rate due to botulinal toxin is 12% and can be 
higher in particular risk groups. [CO. Tacket ct «/.. Am. .1. Med. 76:794 < I9X4>.| Wound- 
induced botulism results Iron, C. hotulinum penetrating traumatized tissue and producing toxin 
that is absorbed into the bloodstream. Since 1950. thirty cases of wound botulism have been 
reported. |M.N. SwarlZ. "Anaerobic Sporc-l-ormm.v Bacilli: The Clostridia." PP . 633-646. ,n 
li.O. Davis ci «/..(eds.). Microbiology. 4il, edition. .1.13. I.ippmcoti C o. (I99<».| Inhalation 
botulism results when the loxin is inhaled. Inhalation botulism has been reported as the result 
of accidental exposure in the laboratory |H. Ilolzer. Med. Klin. 41:1735 (1962)) and could 
arise if the toxin is used as an agent of biological warfare 1 1). It. Franz ct a/., in Botulinum 
15 and Tetanus Xcurotoxms. 13. R. DasGupta. ed.. Plenum Press. New York ( 1993). pp. 473-476]. 
Infectious infant botulism results from C \ hotulinum colonization of the infant intestine with 
production of toxin and its absorption into the bloodstream. It is likely thai the bacterium 
gains entry when spores are ingested and subsequently germinate. |S. Arnon. .1. Infect. Dis. 
154:201 (1 986). | There have been 500 cases reported since it was first recognized in 1976. 
; 0 | M.N. Swartz. .supra.] 
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Infant botulism strikes infants who are three weeks to eleven months old (greater than 
90% of the cases are infants less than six months). |S. Arnon. J. Infect. Dis. 154:201 (1986).| 
, It is believed that infants are susceptible, due. in large part, to the absence of the full adult 
complement of intestinal microflora. The benign microflora present in the adult intestine 
provide an acidic environment that is not favorable to colonization by C. hoiulhwm. Infants 
begin life with a sterile intestine which is gradually colonized by microflora. Because of the 
limited microflora present in early infancy, the intestinal environment is not as acidic, 
allowing for C. hmulinum spore germination, growth, and toxin production. In this regard, 
some adults who have undergone antibiotic therapy which alters intestinal microflora become 
more susceptible to botulism. 

An additional factor accounting for infant susceptibility to infectious botulism is the 
immaturity of the infant immune system. The mature immune system is sensitized to 
bacterial antigens and produces protective antibodies. Secretory IgA produced in the adult 
intestine has the ability, to agglutinate vegetative cells of C. hoiulhmm. |S. Anion. J. Infect 
Dis. 154:201 ( |?86).| Secretory IgA may also act by preventing intestinal bacteria and their 
products from crossing the cells of the intestine. [S. Anion. Epidemiol. Rev. 3:45 (1981 ).| 
The infant immune system is not primed to do this. 

C linical .symptoms of infant botulism range from mild paralysis, to moderate and 
severe paralysis requiring hospitalization, to fulminant paralysis, leading to sudden death. fS. 
Arnon. Upidemiol. Rev. 3:45 (1981).] 

The chief therapy for severe infant botulism is ventilatory assistance using a 
mechanical respirator and concurrent elimination of toxin and bacteria using cathartics, 
enemas, and gastric lavage. There were 68 hospitalizations in California for infant botulism 
in a single year with a total cost of over $4 million for treatment. (T.L. Frankovich and S. 
Arnon. West. J. Med. 154.103 <199n.| 

Different strains of Clostridium hmulinum each produce antigenically distinct toxin 
designated by the letters A-G. Serotype A toxin has been implicated in 26% of the cases of 
food botulism: types B. E and I have also been implicated in a smaller percentage of the food 
botulism cases |ll. Sugiyama. Microbiol. Rev. 44:419 ( 1980)]. Wound botulism has been 
reportedly caused by only types A or B toxins [H. Sugiyama. sup,u\. Nearly all cases of 
infant botulism have been caused by bacteria producing either type A or type B toxin. 
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< Exceptionally, one New Mexico case was caused by Clostridium how/mum producing type F 
toxin and another by Clostridium hoiulinum producing a type B-type F hybrid.) fS. Arnon. 
Epidemiol. Rev. 3:45 (I98l).| Type C toxin affects waterfowl; cattle, horses and mink. Type 
O toxin affects cattle, and type E toxin affects both humans and birds. 

A trivalent antitoxin derived from horse plasma is commercially available from 
Connaught Industries Ltd. as a therapy for toxin types A. B. and I I lowcvcr. the antitoxin 
has several disadvantages. First, extremely large dosages must be injected intravenously 
and/or intramuscularly. Second, the antitoxin has serious side effects such as acute 
anaphylaxis which can lead to death, and serum sickness. Finally, the efficacy of the 
antitoxin is uncertain and the treatment is costly |C.O. Tucket ei at.. Am. J. Med. 76 794 
(I984).| 

A heptavalent equine botulinal antitoxin which uses only the F(ab )2 portion of the 
antibody molecule has been tested by the United States Military. |M. Halady. USAMRDC 
Newsletter. P .6<I99|>.| This was raised against impure toxoids in those- large animals and is 
not a high titer preparation. 

A pcntavalent human antitoxin has been collected from immunized human subjects for 
use as a treatment tor infant botulism. The supply of this antitoxin is limited and cannot be 
expected lo meet the needs of all individuals stricken with botulism disease. In addition, 
collection of human sera must involve screening out HIV and other potentially serious human 
pathogens. |I\J. Schwarz and S.S. Arnon. Western J. Med. 156:197 f 1 992). J 

Infant botulism has been implicated as the cause of mortality in some cases of Sudden 
Infant Death Syndrome (SIDS. also known as crib death). SIDS is officially recognized as 
infant death that is sudden and unexpected and that remained unexplained despite complete' 
post-mortem examination; The link of SIDS to infant botulism came when fecal or blood 
specimens taken at autopsy from SIDS infants were round to contain < \ hoiulinum organisms 
and/or toxin in 3-4% of cases analyzed. (D.R. Peterson et ai. Rev. Infect. Dis. 1:630 
(1 97°). J In contrast, only I of 160 healthy infants (0.6%) had C. haiulmum organisms in the 
feces and no botulinal toxin. (S. Arnon ct «/. .-Lancet; pp. 1273-76. June 17. (978.) 

In developed countries. SIDS is the number one cause of death in children between 
one month and one year old. (S. Arnon et ai. Lancet, pp. 1273-77. June 17. 1978.) More 
children die from SIDS in the first year than from any other single cause o I death in the first 
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fourteen years of! ife. In the United States, there are 8.000-10.000 SIDS victims annually. 
Id 

What is needed is. an effective therapy against infant botulism that is free of dangerous 
side effects, is available in large supply at a reasonable price, and can 'be safely and gently 
delivered so that prophylactic application to infants is feasible. 

Immunization of subjects with toxin preparations has been done in an attempt to 
induce immunity against botulinal toxins. A C botidimtm vaccine comprising chemically 
inactivated U U\. formaldehyde-treated) type A. B. C. D and E toxin is commercially available 
lor human usage. However, this vaccine preparation has several disadvantages. First, the 
eftlcaey of this vaccine is variable (in particular, only 78% of recipients produce protective 
levels of anti-type 13 antibodies following administration of the primary series). Second, 
immunization is painful (deep subcutaneous inoculation is required for administration), with 
adverse reactions being common (moderate to severe local reactions occur in approximately 
6% of recipients upon initial injection; this number rises to approximately I 1% of individuals 
who receive booster injections) [Informational Brochure for the Pentavatent (ABCDF) 
Botulinum Toxoid. Centers for Disease Control). Third, preparation of the vaccine is 
dangerous as active toxin must be handled by laboratory workers. 

What is needed are safe and effective vaccine preparations for administration to those 
at risk of exposure to ( *. hoiidinum toxins. 

C\ difficile 

C. difficile, an organism which gained its name due to difficulties encountered in its 
isolation, lias recently been proven to be an etiologic agent of diarrheal disease. {Nnealh et 
cd.. p. I 165 J.. .('. difficile is present in the gastrointestinal tract of approximately 3% of 
healthy adults, and 10-30% of neonates without adverse effect (Svvartz. at p. 644): by other 
estimates. C. difficile is a part of the normal gastrointestinal flora of 2-10% of humans. JG.F. 
Brooks et <//.. (eds.) "Infections Caused by Anaerobic Bacteria" ./aire/:. Metnick. tv 
Adalbert; s Medical Microbiology. 19th cd.. pp. 257-262. Applelon & Lange. San Mateo. CA 
(IWI).| As these organisms are relatively resistant to most commonly used antimicrobials, 
when a patient is treated with antibiotics, the other members of the normal gastrointestinal 
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flora are suppressed and C. difficile flourishes, producing cytopathic ioxins and enterotoxins. 
It has been found in 25% of cases of moderate diarrhea resulting from treatment with 
antibiotics, especially the cephalosporins, clindamycin, and ampicillin.l IM.N. Svvartz at 644.] 

Importantly. C. difficile is commonly associated with nosocomial infections. The 
organism is often present in the hospital and nursing home environments and may be carried 
on the hands and clothing of hospital personnel who care for debilitated and 
immunocompromised patients. As many of these patients arc being treated with 
antimicrobials or other chemotherapeulie agents, such transmission of C. difficile represents a 
significant risk factor for disease. (Engelkirk ei at., pp. 64-67.) 

C. difficile is associated with a range of diarrhetic illness, ranging from diarrhea alone 
to marked diarrhea and necrosis of the gastrointestinal mucosa with the accumulation of 
inflammatory cells and fibrin, which forms a pseudomcmbrane in the affected area. (Urooks 
a a/. > It has been found in over 95% of pseudomembranous enterocolitis cases. (Swartz. at 
p. 644., This occasionally fatal disease is characterized by diarrhea, multiple small colonic 
plaques, and toxic megacolon, (Svvartz. at p. 644 ) Although stool cultures are sometimes 
used for diagnosis, diagnosis is best made by detection of the heat labile toxins present in 
fecal nitrates from patients with enterocolitis due to C. difficile. (Swartz. at p. 644-645; and 
Brooks ei at.,.* p. 260.) C. difficile toxins are cytotoxic for tissue/cell cultures and cause 
enterocolitis when injected intracccally into hamsters. (Swartz. at p. 644.) 

The enterotoxicity of C. difficile is primarily due to the action of two toxins, 
designated A and B. each of approximately 300.000 in molecular weight Both are potent 
cytotoxins. with toxin A possessing direct cnterocytotoxic activity. [I.vcrlv ei at.. Infect 
Immun. 60:4633 (I992).| Unlike toxin A of (.'. perfrinyen,, an organism rarely associated 
with antimicrobial-associated diarrhea, the toxin of C. difficile is not a spore coat constituent 
25 and is not produced during spoliation. (Swartz. at p. 644.) ( V difficile toxin A causes 
hemorrhage, lluid accumulation and mucosal damage in rabbit ileal loops and appears to 
increase the uptake of toxin B by the intestinal mucosa. Toxin B does no, cause intestinal 
lluid accumulation, but it is 1000 times more tox.c than toxin A to tissue culture cells and 
causes membrane damage. Although both toxins induce similar cellular effects such as actin 
30 disaggregation, differences in cell specificity occurs. 
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Both toxins are important in disease. fBorriello et at. Rev. Infect. Dis., I2(suppl 
2):SI8S (1990). Lycrly et .at.. Infect. Immun.. 47:349 (1985): and Rolfe. Infect. Immun.. 
59:1223 (1990). | Toxin A is thought to act first by binding to brush'bordcr receptors, 
destroying the outer mucosal layer." then allowing toxin B to gain access to the underlying 
:» tissue. These steps in pathogenesis would indicate that the production of neuiraliziim 

antibodies against toxin A may be sufficient in the prophylactic therapy of CDAD. However, 
antibodies against toxin B may be a necessary additional component for, an effective 
therapeutic against later stage colonic disease. Indeed, it has been reported that. animals 
require antibodies to both toxin A and toxin B to be completely protected against the disease. 
10 | Kim and Rolfe. Abstr. Ann. Meet. Am. Soc. Microbiol.. 69:62 ( 1 987) J 

( difficile has also been reported to produce other toxins such as an enteroioxin 
different from toxins A and B [Banno cv at . Rev. Infect. Dis.. 6(Suppl. 1:SI I-S20 (1984)|. a 
low molecular weight "toxin |Rihn ef at. Biochem. Biophys. Res. C omm.. 124:690-695 
\ I984)|. a motility altering factor [Justus a at. Gastroenterol.. 83:8.16-843 (I982)|. and 
15 perhaps other toxins. Regardless. C. difficile gastrointestinal disease is of primary concern. 

It is significant that due to its resistance to most commonly used antimicrobials. ( '. 
difficile is associated with antimicrobial therapy with virtually all antimicrobial agents 
(although most commonly ampicillin. clindamycin and cephalosporins). It is also associated 
with disease in patients undergoing chemotherapy with such compounds as methotrexate. 5- 
20 lluorouracil. cyclophosphamide, and doxorubicin. |S.M. Finegold ef at. ( 'finical Guide u> 
Anuerohic Infections, pp. 88-89. Star Publishing Co.. Belmont. CA (1992).| 

treatment of T difficile disease is problematic, given the high resistance of the 
organism. Oral metronidazole, bacitracin and vancomycin have been reported to be effective, 
(f inegold ef at. p. 89.) However there are problems associated with treatment utilizing these 
!:> compounds. Vancomycin is very expensive, some patients are unable to take oral medication, 
and the relapse rale is high (20-25%). although it may not occur for several weeks. /</. 

( difficile disease would be prevented or treated by neutralizing the effects of these 
toxins in the gastrointestinal tract. Thus, what is needed is an effective therapy against (\ 
difficile toxin that is free of dangerous side effects, is available in large supply at a reasonable 
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price, and can be safely delivered so .thai, prophylactic application to patients at risk of 
developing pseudoincmbranous enterocolitis can be effectively treated. 

DESCRIPTION OF THE l)RAWFNC;S 

Figure I shows the reactivity of anti-C hoiulinum IgY by Western blot, 
figure 2 slums the IgY antibody titer to ( /„„„//„„„, type A toxoid in e glIs . measured 
by l-I. ISA. 

figure 3 shows the results off. difficile toxin A neutralization assavs. 

Figure 4 shows the results ol C. difficile toxin B neutral izaiion assays. 

figure 5 shows the results of C difficile toxin » neutralization assays. 

figure 6 is a restriction map of C. difficile toxin A gene, showing sequences of 
primers 1-4 (SFO II) NOS: 1-4). 

f igure 7 is a Western blot of ( difficile toxin A reactive protein. 

f igure K shows < . difficile loxin A expression constructs. 

figure 9 shows ( \ difficile toxin A expression constructs. 

f igure 10. shows the purification of recombinant C difficile toxin A. 
f igure I I shows the results of <;. difficile toxin A neutralization assays w,h antibodies 
reactive to recombinant toxin A. 

f igure 12 shows the results for a C. difficile toxin A neutralization plate. 

f igure 13 shows the results for a C. difficile toxin A neutral ization plate. 

figure 14 shows the results of recombinant ( difficile toxin A neutralization assavs. 

Figure 15 shows ( difficile toxin A expression constructs. 

l igure 16 shows a chroma.ograph plotting absorbance at 280 nm against retention time 
for a p.VIA 1X70-6X0 IgY PFCi preparation. 

Figure 1 7 shows two recombinant ( difficile toxin » expression constructs, 
f igure 18 shows C. difficile toxin B expression constructs, 
f igure l<> shows C. difficile toxin U expression constructs, 
figure 20 shows C. difficile toxin U expression constructs. 

figure 21 is an SDS-IVUiF gel showing lhtf purification ol recombinant C \ difficile 
toxin it fusion proiein. 
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Figure 22 is an SDS-PAGE gel showing the purification of two histidine-iagged 
recombinant C difficile xoxm B proteins. 

Figure 2.3 shows C\ difficile toxin B expression constructs. 

Figure 24 is a Western blot of C. difficile toxin B reactive protein. 

Figure 25 shows (* hotulinum type A toxin expression constructs: constructs used to 
provide ( ' hofiiiimtm or C \ difficile sequences are also shown. 

Figure 26 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
of recombinant ( * hoiultnum type A toxin fusion proteins. 

Figure 27 shows C. hotulimim type A toxin expression constructs; constructs used to 
provide ( '. Inuulinum sequences are also shown. 

Figure 28 is an SOS-PAGE gel stained with Coomaisse blue showing the purification 
of pliisBot protein using the Ni-NTA resin. 

figure 2 C > is an SDS-PAGE gel stained with Coomaisse blue showing the expression of 
pliisBot protein in B1.2KDE3) and BL2 l(DE3)pLysS host cells. 

f igure 30 is an SDS-PAGF. gel stained with Coomaisse blue showing the purification 
of pliisBot protein using a batch absorption procedure. 

Figure 31 is an SDS-PAGF gel stained with Coomaisse blue showing the purification 
of pliisBot and pHisBot(nativc) proteins using a Ni-NTA column. 

Figure 32 is an SDS-PAGF. gel stained with Coomaisse blue showing the purification 
of pllisBolA protein expressed in pi lisBolA(syn) kan laclq T7/pACYCGro/BI.2 UDE3) cells 
using an IDA column. 

figure 33 is an SDS-PAGF~ gel stained with Coomaisse blue showing the purification 
of pliisBoiA. pHisBolB and pIlisBotF proteins by IDA chromatography followed by 
chromatography on S-100 to remove folding chape rones. 

Figure 34 is an SDS-PAGF gel stained with Coomaisse blue showing the extracts 
derived from pllisBotB amp T7lac/BL2 UDE3) cells before and after purification on a Ni- 
NTA column. 

Figure 35 is an SDS-PAGE gel run under native conditions and stained with 
Coomaisse blue showing the removal of folding chaperones from IDA-purified BotB protein 
using a S-i(K) column. 



W ° 98/08540 I PCT/US97/15394 

Figure 36 is an SOS-PAGO gel stained with Coomaisse blue showing proteins that 
eluted durmg an imidazo , e step gradiem appHed (<) a UM ^ n ^ ^ ^ 

pHisBotB kan laclq T7/pACYCGro/BL2l(DE3) cells | 

Figure 37 is an SDS-PAGE gel run under native conditions and stained with 

C oomaisse blue showing IDA-purifled BolB protein before and alter ultrafiltration 

Figure 38 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 

»»l Uotfc- protein using a NiNTA column. 

Figure 39 is an SDS-PAGE gel stained with Coomaisse blue show in, extracts derived 
.rom pliisBotA kan T7 lac/BL2«(DE3) pLysS cells grown in fermentation culture. 

Figure 40 is a chromatogram showing protems present after 1DA- P urif,ed Botl- protein 
was applied to a S- 100 column. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below 
As used herein, the term "neutralizing" is used in reference to antitoxins, particular* 
anntoxms comprising antibodies, which have the ability to prevent the patho.ouica. actions of 
the toxin against which the antitoxin is directed. 

As used herein, the term "overproducing" is used in reference to the production of 
clostridial toxin polypeptides in a host cell and indicates tha, the host cel. is produci,,- more 
«>« «he clostridial toxin by virtue ol the introduction of nucleic acid sequences encoding said 
clostnd.a. toxin polypeptide than would be expressed by said host cel. absent the introduction 
o« sa,d nue.ee acid sequences. To allow ease of purification of toxin po.vpepudes produced 
m a host cell ,t is preferred .ha, the host cell express or overproduce said toxin P olv P e P ,ide at 
a level greater than I mg/liter of host cell culture. 

"A host cell capable of expressing a recombinant protein at a level ureater than or 
equal to 5% of the total cellular protein" is a host cel. in which the recombmant protein 
represents at least 5% of the total cellular protein. To determine vvhu, percentaue of total 
cellular protein the recombinant protem represents, the following steps are taken. A total of 
10 ° D ll " ilS ? r "^inam host cells (,..<.. 200 M | of cells at ()D M „ 50/mh are removed 
«n« a umepoin, known to represent the peak of expression of the desired recombinant protem, 
io a I > ml m.crofuge tube and pelleted for 2 min a, maximum rpm in a microbe. The 
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pellets are resuspended in I ml of 50 mM NaHP0 4 . 0.5 M NaCI 40 mM imidazole buffer 
(pH 6.8) containing I mg/rril lysozyime. The samples are incubated for 20 min at room 
temperature and stored ON at -70°C. Samples are thawed completely at room temperature 
and sonicated 2X10 seconds with a Branson Sonifier 450 microtip probe a( # 3 power 
setting. The samples are centrifuged for 5 min. at maximum rpm in a microfuge An aliquot 
(20 j.il) of the protein sample is removed to 20 pi 2X sample buffer (this represents the lota! 
protein extract). The samples are heated to 95°C for 5 min. then coolc,d and 5 or 10 ja! are 
loaded onto 12.5% SDS-PAGE gels. High molecular weight protein markers arc also loaded 
lo allow for estimation ol the MW of identified recombinant proteins. After electrophoresis, 
protein is detected generally by staining with Coomassie blue and the stained gel is scanned 
using a densitometer to determine the percentage of protein present in each band. In this 
manner, the percentage of protein present in the band corresponding to the recombinant 
protein ol' interest may be determined. It is not necessary that C oomassie blue be employed 
Cor the detection of protein, a number of fluorescent dyes ft'.,if.. Sypro orange S-665'1 
l> i Molecular I'robes. Eugene. OR| may be employed and the stained gel scanned using a 
lluoroimager jt' .e;.. f luor Imager SI (Molecular Dynamics. Sunnyvale. CA)|. 

"A host cell capable of expressing a recombinant protein as a soluble protein at a level 
greater than or equal to 0.25% of the total soluble cellular protein" is a host cell in which the 
amount of soluble recombinant protein present represents at least 0.25% of the total cellular 
20 protein. As used herein "total soluble cellular protein" refers to a clarified PEI lysate 

prepared as described in Example 3l(c)(iv). Briefly, cells are harvested following induction 
ol expression of recombinant protein < at a point of maximal expression*. The cells are 
resuspended in cell resuspension buffer (CRB: 50 mM NaPQ 4 . 0.5 M NaCI. 40 mM 
imidazole, pi I 6.8) to create a 20% cell suspension (wet weight of cells/volume of CRB)' and 
2? cell lysates are prepared as described in Example 3 l(c)(iv) </.c. sonication or homogenization 
followed by centrifugation). The cell lysate is then flocculated utilizing polyethyleneimine 
(MEI) prior to centrifugation. PEI (a 2% solution in drTO. pH 7.5 with IICI) is added to the 
cell lysate to a final concentration of 0.2%. and stirred for 20 min at room temperature prior 
to centrifugation |8.500 rpm in JAI0 rotor (Beckman) Tor 30 minutes at 4°C|. This treatment 
30 removes KNA. DNA and cell wall components, resulting in a clarified, low viscosity lysate 
("PHI clarified lysate"). The recombinant protein present in the PEI clarified lysate is then 
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purified «,.<.. by chromatography on an IDA column for his-tagged proteins,. The amount of 
purified recombinant protein Uc the eluted protein) is divided hy the concentration of 
protein present in the |»|£I clarified lysatc (typically 8 mg/ml when using a 20% cell 
suspension as the starting material) and multiplied by |()() to determine Avhut percentaue of 
total soluble cellular protein is comprised of the soluble recombinant protein (see lixample 



b). 



As used herein, the term "fusion protein" refers to a -chimeric protein containing the 
protein of interest (/.,.. C /,„/„//„„,„ loxin A . B. C. D. E. f . or Ci and fragments thereof, 
joined to an exogenous protein fragment (the fusion partner which consists of a non-toxin 
protein). The fusion partner may enhance solubility of the ( '. hmutinum protein as expressed 
in a host cell, may provide an affinity tag to allow purification of the recombinant fusion 
protein Iron, the host cell or culture supernatant, or both. If desired, the fusion protein mav 
be removed from the protein of interest (i.e.. toxin protein or fragments thereof prior to 
immunization by a variety of enzymatic or chemical means known to the art. 

As used herein the term "non-toxin protein" or "non-toxin protein sequence" refers to 
that portion of a fusion protein which comprises a protein or protein sequence w hich is not 
derived from a bacterial toxin protein. 

The term "protein of interest" as used herein refers to the protein whose expression is 
desired within the fusion protein. In a fusion protein the protein of interest will be joined or 
fused with another protein or protein domain, the fusion partner, to allow for enhanced 
stability of the protein of interest and/or ease of purification of the fusion protein. 

As used herein, the term "maltose binding protein" refers to the maltose binding 
protein of /.. voli. A portion of the maltose binding protein may be added to a protein of 
interest to generate a fusion protein: a portion of the maltose binding protein may merely 
enhance the solubility of the resulting fusion protein when expressed in a bacterial host. On 
the other hand, a portion of the maltose binding protein may allow affinity purification of the 
fusion protein on an amylose resin. 

As used herein, the term "poly-histidine tract" when used in reference to a fusion 
protein refers to the presence of two to ten histidine residues at either the amino- or carboxy- 
terminus of a protein of interest. A poly-histidine tract of six to ten residues is preferred. 
The poly-histidine tract is also defined functionally as being a number of consecutive histidine 



- 14 - 



WO 98/08540 PCT/US97/15394 

residues added to the protein of interest which allows the affinity purification of the resulting 
fusion, protein on a nickeUchelaic or IDA column. 

As used herein, the term "purified" or "to purify" refers to the! removal of 
contaminants from a sample. I or example, antitoxins are purified by removal of 
contaminating non-immunoglobultn proteins; they are also purified by the removal of 
immunoglobulin that does not bind toxin. The removal of non-immunoglobulin proteins 
;ind/or the removal of immunoglobulins that do not bind toxin results in an increase in the 
percent of toxin-reactive immunoglobulins in the sample. In another example, recombinant 
toxin polypeptides are expressed in bacterial host cells and the toxin polypeptides arc purified 
by the removal of host cell proteins: the percent of recombinant toxin polypeptides is thereby 
increased in the sample. Additionally, the recombinant toxin polypeptides arc purified by the 
removal -of host cell components such as lipopolysaccharidc ic.^.. endotoxin) 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule 
which is comprised of segments of DNA joined together by means of molecular biological 
techniques. 

I he term "recombinant protein" or "recombinant polypeptide" as used herein refers to 
a protein molecule which is expressed from a recombinant DNA molecule. 

[We term "native protein" as used herein refers to a protein which is isolated from a 
natural source as opposed to the production of a protein by recombinant means. 

As used herein the lerm "portion" when in reference to a protein (as in "a portion of a 
given protein") refers to fragments of that protein. The fragments may range in size from 
lour amino acid residues to the entire amino acid sequence minus one amino acid. 

As used herein "soluble" when in reference to a protein produced by recombinant 
DNA technology in a host cell is a protein which exists in solution in the cytoplasm of the 
host cell; if the protein contains a signal sequence the soluble protein is exported to the 
periplasms space in bacteria hosts and is secreted into the culture medium in eucaryotic cells 
capable o| secretion or by bacterial host possessing the appropriate genes (i.e.. the kit uenej. 
In contrast, an insoluble protein is one which exists in denatured form inside cytoplasmic 
granules (called inclusion bodies) in the host cell. High level expression (U>.. greater than 10- 
20 mg recombinant protein/liter of bacterial culture) of recombinant proteins often results in 
the expressed protein being found in inclusion bodies in the bacterial host cells. A soluble 
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protein is a protein which is not found in an inclusion body inside the host cell or is found 
both in the cytoplasm and in inclusion bodies and in this case the protein may be present at 
high or low levels in the cytoplasm. 

A distinction is drawn between a soluble protein {i.e.. a protein which when expressed 
in a host cell is produced in a soluble form) and a "solubilized" protein. An insoluble 
recombinant protein found inside an inclusion body may be solubilized (i.e.. rendered into a 
soluble form) by treating purified inclusion bodies with denaturants such' as guanidine 
hydrochloride, urea or sodium dodecyl sulfate (SDS). These denaturants must then be 
removed from the solubilized protein preparation to allow the recovered protein to rcnature 
(refold). Not all proteins will refold into an active conformation after solubilization in a 
denaturant and removal of the denaturant. Many proteins precipitate upon removal of the 
denaturant. SDS may be used to soiubilizc inclusion bodies and will maintain the proteins in 
solution at low concentration. However, dialysis will not always remove all of the SDS (SDS 
can form micelles which do not dialyze out): therefore. SDS-solubilizcd inclusion body 
protein is soluble but not refolded. 

A- distinction is drawn between proteins which are soluble ( i e.. dissolved) in a 
solution devoid of significant amounts of ionic detergents SDS) or denaturants u* 

urea, guanidine hydrochloride) and proteins which exist as a suspension of insoluble protein 
molecules dispersed within the solution. A soluble protein will not be removed from a 
solution containing the protein by centrifugation using conditions sufficient to remove bacteria 
present in a liquid medium {i.e.. centrifugation at 12.000 \ g for 4-5 -minutes). For example* 
to test whether two proteins, protein A and protein B. are soluble in solution, the two proteins 
are placed into a solution selected from the group consisting of PBS-NaCI (PBS containing 
0.5 M NaCi). PBS-NaCI containing 0.2% Tween 20. PBS. PBS containing 0.2% Tween 20. 
PBS-C (PBS containing 2 mM CaCF). PBS-C containing either 0.1 or 0.5 % Tween 20. PBS- 
C containing either 0.1 or 0.5% NP-40. PBS-C containing either 0.1 or 0.5% Triton X- 1 00. 
PBS-C containing 0.1% sodium deoxycholate. The mixture containing proteins A and B is 
then centrifuged at 5000 x g for 5 minutes. The supernatant and pellet lormed by 
centrifugation are then assayed for the presence of protein A and B. If protein A is found in 
the supernatant and not in the pellet (except for minor amounts {i.e.. less than 10%) as a 
result of trapping!, protein is said to be soluble in the solution tested. If the majority of 
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protein B is found in the pellet {La.: greater than 90%). then protein B is said to exist as a 
suspension in the solution tested. 

As used herein, the term "therapeutic amount" refers to that amount of antitoxin 
required to neutralize the pathologic effects of one or more clostridial toxins in a subject. 

The term "pyrogen" as used herein refers to a fever-producing substance. Pyrogens 
may be endogenous to the host (e.g.. prostaglandins) or may be exogenous compounds (e 
bacterial endo- and exotoxins, nonbacterial compounds such as antigens and certain steroid 
compounds, etc.). "The presence of pyrogen in a pharmaceutical solution may be delected 
using the U.S.' Pharmacopeia (USP) rabbit fever test (United States Pharmacopeia. Vol. XXII 
(1900) United States Pharmacopeial Convention. Rockvillc. MD. p. 151). 

The term "endotoxin" as used herein refers to the high molecular weight complexes 
associated with the outer membrane of gram-negative bacteria. Uiipurifted endotoxin contains 
lipids, proteins and carbohydrates. Highly purified endotoxin does not contain protein and is 
referred to as lipopoiysaccharide (LPS). Because unpurifled endotoxin is of concern in the 
production -of pharmaceutical compounds (c.^.. proteins produced in E. volt using recombinant 
DNA technology), the term endotoxin as used herein refers to unpurificd endotoxin. Bacterial 
endotoxin is a well known pyrogen. 

As used herein, the term "endotoxin-free" when used in reference to a composition to 
be administered parenteral ly (with the exception of intrathecal administration) to a host means 
that the dose to be delivered contains less than 5 FT l/kg body weight [PDA Guidelines for 
Parenteral Drugs (December I987)|. Assuming a weight of 70 kg for an adult human, the 
dose must contain less than 350 KU to meet FDA Guidelines for parenteral administration. 
[Endotoxin levels are measured herein using the Umuius Amebocyte Lysate (LAL) lest 
(Limulus Amebocyte Lysate Pyrochrome' M . Associates of Cape Cod. Inc. Woods Hole. MA). 
To measure endotoxin levels in preparations of recombinant proteins. 0.5 ml of a solution 
comprising 0.5 mg of purified recombinant protein in 50 mM NaP() 4 . pH 7.0. 0.3M NaCI and 
10% glycerol is used in the LAL assay according to the manufacturer's instructions for the 
endpoint chromogenic without diazo-coupling method [the specific components of the buffer 
containing recombinant protein to be analyzed in the LAL test are not important: any buffer 
having a neutral pH may be employed (see for example, alternative buffers employed in 
Examples 34. 40 and 45 )|. Compositions containing less than or equal to than 250 endotoxin 
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UK- «crn, -monovalent" when used in reference to a clostridial vaccine refers to , 
vaccne which is capaNe ofprovo.in, an inunune response in a host ani ma , directed aJainst a 
s.n,lc tvpe of Costridia, toxin. For example, if imman^uon of a host w ith < ; ht „ lttlmm , ' 
t>pe A ,<,x,n vaccine induces ant.bodies in the immunized host which protect gainst , 
challenge w„h type A toxin but no. against challenge with tvpe H. C. D F I- or C, toxins 
then the type A vaccine is said to be monovalent. In contrast, a "■nuhivalen," vaccne 
Provokes an immune response in a host animal directed against several more than one, 
c.«str,d,a, toxins. l or example, if immunization of a host with a vaccine comprising C ' 
h0,U ' hU,m ,> PC A ^ B lOXi " S Pnxluction of antibodies which protect the host 

ngatnst a cha.lenge with both type A and R toxin, the vaccine is said to he multivalent ,in 
particular. th, s hypothetical vaccine is bivalent). 
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As used herein the term "immunogenically-effective amount" refers to that amount of 
an immunogen required to invoke the production of protective levels of antibodies in a host 
upon vaccination. t 

The -term "protective level", when used in reference to the level of antibodies induced 
upon immunization of the host with an immunogen which comprises a bacterial toxin, means 
a level of circulating antibodies sufficient to protect the host from challenge with a lethal dose 
of the toxin. 

As used herein the terms "protein" and "polypeptide" refer to compounds comprisinti 
amino acids joined via peptide bonds and are used interchangeably. 

The terms "toxin" and "neurotoxin" when used in reference to toxins produced bv 
members {i.e.. species and strains) of the genus Clostridium are used interchangeably and 
refer lo the proteins which arc poisonous to nerve tissue. 

The term "receptor-binding domain" when used in reference to a (* hotulinum toxin 
refers to the carboxy-terminal portion of the heavy chain (ll t or the C fragment) of the toxin 
which is presumed to be responsible for the binding of the active toxin (i.e.. the derivative 
toxin comprising the H and L chains joined via disulfide bonds) to receptors on the surface of 
synaptosomcs. The receptor-binding domain for C hatulimtm type A toxin is defined herein 
as comprising amino acid residues 861 through 1296 of SEQ II) NO:28. I lie receptor- 
binding domain for ( hotulinum type B toxin is defined herein as comprising amino acid 
residues 848 through 1291 of SEQ ID NO:40 (strain Eklund 1711). The receptor-binding 
domain of ( ' hotulinum type CI toxin is defined herein as comprising amino acid residues 
856 through 1291 of SEQ ID NO:60. I hc receptor-binding domain of C hotulinum type D 
toxin is defined herein as comprising amino acid residues 852 through 1276 of SEQ ID 
NO:66. The receptor-binding domain of C\ hotulinum type E toxin is defined herein as 
comprising amino acid residues 835 through 1250 of SEQ ID NO:50 (Beluga strain). The 
receptor-binding domain of C. hotulinum type F toxin is defined herein as comprising amino 
acid residues 853 through 1274 of SEQ ID NO:7I. The receptor-binding domain of (* 
hutulinum type (i toxin is defined herein as comprising amino acid residues 853 throuuh 1297 
of SEQ ID NO:77. Within a given serotype, small variations in the primary amino acid 
sequence of the botulinal toxins isolated from different strains has been reported [Whelan ct 
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at. (1992). supra and Minton (1995) Curr. Top Microbiol. Immunol. 195:161-194] The 
present invention contemplates fusion proteins comprising the receptor-bindinu domain of f 
ho.ulnn,,,, toxins from serotypes A-G including the variant, found among differen, strains 
wuhtn a yiV en serotype. The receptor-binding domains listed above arc used as the prototype 
«or each strain within a serotype. Fusion proteins containing an analogous region from a " 
stram other than the prototype strain are encompassed by the present invention. 

f usion proteins comprising the receptor bindmg domain </.,.. C fragment, of botulinal 
«oxms may include amino acid residues located beyond the termini of the domains defined 
"hove. 1 or example, the pHisBo.B protein contains amino acid residues 846-1291 of SI Q ID 
NO:40: this fusion protein thus comprises the receptor-binding domain (or < /„„„,/,„„„ type 
B lox.n as defined above ,/ He-848 through Cilu-129, ,. Similarly. pHisBoti: contains 
ammo acid residues 827-,252 of SHQ ,D NO:50 and pl.isBotCi contains amino acid residues 
8M-.297 ofSRO ,D N():77. Thus, both pHisBotH and pHisBo.C, fusion protems contain a" 
leu ammo actds located beyond the N-.erminus of the defined receptor-binding domain. 

The terms "native gene" or "native gene sequences" are used to indicate DIM A 
sequences encoding a particular gene which contain the same DNA sequences as found in the 
gene as .solated from nature. In contrast, "synthetic gene sequences" are DNA sequences 
wh.ch are used to replace the naturally occurring DNA sequences when the naturallv 
occurring sequences cause expression problems in a given host cell, for example, nalurallv- 
occumng DNA sequences encoding codons which are rarely used in a host cell mav be 
replaced by site-directed mutagenesis) such that the synthetic DNA sequence represents 

a more frequently used codon. The native DNA sequence and the synthetic DNA sequence 
will preferably encode the same amino acid sequence. 

SUMMARY OF THE INVENTION 

The present invention relates to the production of polypeptides derived from toxins 
parucularly in recombinant host cells. In one embodiment, the present mvention provides a 
host cell containing a recombinant expression vector, said vector encoding a protein 
comprising a, least a portion of a < tosnidiu,,, W,,,,,,,, toxin, said toxin selected from the 
group consisting of type B toxin and type E toxin. The present invention ,s not hmited bv 
Che nature of sequences encoding portions „f ,„ c C , /„„„/,„„„, toxin. These sequences mav be 
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derived from the native gene sequences or alternatively they may comprise synthetic gene 
sequences. Synthetic gene sequences are employed when expression of the native gene 
, sequences is problematic in a given host cell (e.g.. when the native gene sequences contain 
sequences resembling yeast transcription termination signals and the desired host cell is a 
yeast cell). 

In one embodiment, the host cell is capable of expressing the recombinant C 
hoiuiinum toxin protein at a level greater than or equal to 2% to 40% of the total cellular 
protein and preferably at a level greater than or equal' to 5% of the total cellular protein. In 
another embodiment, the host cell is capable of expressing the recombinant C homtmum 
toxin protein as a soluble protein at a level greater than or equal to 0.25% of the total cellular 
protein and preferably at a level greater than or equal to 0.25% to 10% of the total cellular 
, protein. 

The present invention is not limited by the nature of the host cell employed for the 
production of recombinant ( * hoiuiinum toxin proteins. In a preferred embodiment, the host 
cell is an coii cell. In another preferred embodiment, the host cell is an insect cell; 
particularly preferred insect host cells are Spodoptcra Jrugipertia (SfV) cells. In another 
preferred embodiment, the host cell is a yeast cell: particularly preferred yeast cells arc Pichia 
pustoris cells. 

In another embodiment, the invention provides a host cell containing a recombinant 
expression vector, said vector encoding a fusion protein comprising a non-toxin protein 
sequence and at least a portion of a ( los/ridium hoiuiinum toxin, said toxin selected from the 
group consisting of type 13 toxin and type [£ toxin. The invention is not limited by the nature 
ol the portion of the Clostridium hoiuiinum toxin selected- In a preferred embodiment, the 
portion of the toxin comprises the receptor binding domain c/.t\. the C fragment of the toxin). 
The present invention is not limited by the nature of the non-toxin protein sequence 
employed. In a preferred embodiment, the non-toxin protein sequence comprises a poly- 
histidine tract. A number of alternative fusion tags or fusion partners are known to the art 
U'.t,'.. MB P. (iST. protein A. etc.) and may be employed for the production of fusion proteins 
comprising a portion of a botulinal toxin. 
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The present invention further provides a vaccine comprising a fusion protein said 
lus,on protein comprising a non-toxin protein sequence and at leas, a portion of a Clo«rU ium 
hotuhmuu toxin, said toxin selected IVom the group consisting of type B toxin and tvpe F 
toxin. The vaccine may be a monovalent vaccine (/.,.. containing onlv a toxm B fusion 
protein or a toxin E fusion protein,, a bivalent vaccine . contain.ng both a toxin B fusion 
protem and a toxin E fusion protem, or a trivalen, or higher valencv vaccine. |„ a pre(crrcd 
embodiment, the toxin B fusion protein and/or toxin E fusion protein is combined with a 
»us.o„ protein comprising a non-toxin .protein sequence and at least a portion of CouriJiun, 
W/„„„, type A toxin. The present mvention is not limited by the nature of the portion of 
the ( /„.„,-,,/,,„„ W /w/w toxin ,„ a pref . erred embodjment |hc por( . on of u xjn 

eompnses the receptor binding domain a,., the C fragment of the toxin). The present 
mvention is not limited by the nature of the non-toxin protein sequence cmploved. In a 
pre.erred embodiment, the non-toxin protein sequence comprises a polv-histidinc tract V 
number ol alternative fusion tags or fusion partners are known to the ar, w.x.. MBP. ( iS , 
protein A. etc. ) and may be employed lor the generation of fusion proteins comprising 
vaccines.. When a fusion partner (i.e.. the non-toxin protein sequence, is emploved for the 
production of a recombinant ( . iuuulinal toxin protein. , he fusion partner mav be removed 
from the recombinant C. Ho.ulinal toxin protein if desired </.,.. prior to administration of the 
protem to a subject, using a variety of methods known to the ar, (, .... digestion of fusion 
proteins comaining I actorXa or thrombin recognition sites with the appropriate cnzvme,. A 
number of the pliTHis v ectors employed herein provide an N-terminal his-tag followed bv a 
l acorXa cleavage site (see Example 2Xa>: the botulinal C fragment sequences follow the 
I actorXa site and thus. l actorXa can be used lo remove the his-tag from the botulinal fusion 
protein. In a preferred embodiment, thc vaccine is substantially endotoxin-free. 

The present invention is not limited by the method employed for thc generation of 
vaccine comprising fusion proteins comprising a non-toxin protein sequence and at leas, a 
portion of a ( W/c//,,,„ />„,„/,,„„„ loxin . Thc fusioil proteins mfly ^ produccd by 
recombinant DNA means using either native or synthetic gene sequences expressed in a host 
eell. The present invention is not limited to thc production of vaccines using recombinant 
host cells: cell free in vim, transcription/translation systems may be employed for the 
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expression of the nucleic acid constructs encoding the fusion proteins of the present invention. 
An example of such a cell-free system is the commercially available TnT™ Coupled 
Reticulocyte Lysate System (Promega Corporation. Madison. Wl). Alternatively, the fusion 
proteins of the present invention may he generated by synthetic means (i.e.. peptide 
synthesis). 

The present invention further provides a method of generating antibody directed 
against a Clostridium hotulinum toxin comprising: a) providing in any order: i) an antigen 
comprising a fusion protein comprising a non-toxin protein sequence and at least a portion of 
a Clostridium hotulinum toxin, said toxin selected from the group consisting of type 13 toxin 
and type ii toxin, and ii) a host: and b) immunizing the host with the antigen so as to 
generate an antibody. In a preferred embodiment, the antigen used to immunize the host also 
, contains a fusion protein comprising a non-toxin protein sequence and at least a portion of 
Clostridium hotulinum type A toxin, t he present invention is not limited by ihe nature of the 
, portion of the ( tostridium hotulinum toxin selected. In a preferred embodiment, the portion 
of the toxin comprises the receptor binding domain {i.e.. the C fragment of the toxin). The 
prcscm invention is not limited by the nature of the non-toxin protein sequence employed. In 
a preferred embodiment, the non-toxin protein sequence comprises a poly-histidinc tract. A 
number of alternative fusion lags or fusion partners arc known to the art <e.,t;.. MBP. GST. 
protein A. etc.) and may be employed for the generation of fusion proteins comprising 
vaccines. When a fusion partner </.t\. the non-toxin protein sequence) is employed for the 
production of a recombinant C hotutimtl toxin protein, the fusion partner may be removed 
Irom tiie recombinant C hotulinul toxin protein if desired (/.<•.. prior to administration of the 
protein to a subject) using a variety of methods known to the art a-.*., digestion of fusion 
proteins containing FactorXa or thrombin recognition sites with the appropriate enzyme). 

The present invention is not limited by the nature of the host employed for the 
production of the antibodies of the invention. In a preferred embodiment, the host is a 
mammal, preferably a human. The antibodies of the present invention may be generated 
using non-mammalian hosts such as birds, preferably chickens. In a preferred embodiment 
the method of the present invention further comprised the step c) of collecting the antibodies 
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fiom die host, in vet another embodiment, the method of the present ' invention further 
comprises the step d) of purifying the antibodies., 

The present invention further provides antibodies raised according to the above 
methods. 

The present invention further contemplates multivalent vaccines comprising at least 
uvo recombinant < ; /„„„/, m „„ toxin prmeins derjved fmm ^ ^ J ^ 

Minm ' °- C °- E - F " ™* * The invention contemplates biva.cn, trivalen, 

uuadrava.ent. pcntavalen,. heptava.ent and septivalen, vaccines comprising recombinant ( 
A"«rfW toxin protems. Preferably the recombinant C. W,™,, l0 xi„ protem comprise, 
the receptor binding domain tie.. C fragment) of the toxin. 

DESCRIPTION OF THE INVENTION 

The present invention contemplates vaccinating humans and other animals with 
polypeptides derived from ('. h o,ulun m neurotoxins which are sub.stantia.lv endotoxin- free 
These botu.ina. peptides are also useful for the production of antitoxin. Ami-botulinal toxin 
unumxin ,s useful for the treatment of patients effected by or at risk of svmptoms due to the 
action ol ( • /„„„/,„,„„ loxin5 , The oruanisms loxjns apd jnd . v . dua| s(cps of ^ presciji 
invention ure described separately below. 

I- Clostridium Species, Clostridial Diseases And Associated Toxins 

A preferred embodiment of the method of the present invention ,s directed toward 
obtaining antibodies against ClovmUunr species, their toxins, enzymes or other metabolic bv- 
products. cell wall components, or synthetic or recombinant versions of am of these 
compounds, It is contemplated that these antibodies will be produced bv immunization of 
humans or other animals. I, is not intended that the present invention be limited to anv 
particu.ar toxin or any species of organism. In one embodiment, toxins from all ( WfcfiW 
spec.es arc contemplated as immunogens. Cxamp.es of these toxins mclude ihc neuraminidase 
toxin ol C. Inayricunt. C . sordellii toxins II I and I. I . toxins A. H. C. D. \ L ,.. and (i of r 
Wi„„ w and the numerous ( perfr^ens toxins . hl unc p ,. L . tcrrcd cmoodimcnI . U)xins A 
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13. and E of C hotulinum are contemplated as tmmunogens Table 2 above lists various 
Clostridium species, their toxins and some antigens associated with disease. 

■ •. : i 

TABLE 2 



Organism 


Toxins and Disease-Associated Antieens 


( htiiutinttm 


A. B. C„ C\. D. E. F. G 


< \ httivrtcUM 


Neuraminidase 


C difficile 


A. B. Enierotoxin <noi A nor B). Motility Altering Factor. Low 
Molecular Weight Toxin. Others 




a. P. c. i. y. 6. v. 0. k\ jt. o 


i\ Minfclh 

i ' hifci'incntans 


HT. IX a. P. y 


( '. iittryi 


a. |J. y. 6. v. 0 


< ' SC$NiVHlH 


a. p. y. o • 


< '. titstolyuctttn 


a. p. y. *v c plus additional enzymes 


{ . i /itHtV4H. t i 


a. (1. y. o 



ll is not intended that antibodies produced against one toxin will only be used against 
that toxin., It is contemplated that antibodies directed against one toxin U\.c\. ( '. perfrin^ens 
type A enierotoxin) may be used as an effective therapeutic against one or more toxin(s) 
produced by other members of the genus Clostridium or oilier toxin producing organisms 
U\.t:.. Bacillus cereus. Staphylococcus aureus. Streptococcus notions. Acinetolutcter 
calcoaccttcus. Pseudomonas aeruginosa, other Pseudomonas species, etc.). It is further 
contemplated thai antibodies directed against the portion of the toxin which binds to 
mammalian membranes <c\t:.. (*. perfrin^ens enierotoxin A) can also be used against other 
organisms. It is contemplated that these membrane binding domains are produced 
synthetically and used as immunotiens. 



II- Obtaining Antibodies In Non-Mammals 

A preferred embodiment of the method of the present invention for obtaining 
amibodies involves immunization. However, it is also contemplated that antibodies could be 
obtained from non-mammals without immunization. In the case where no immunization is 
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contemplated, the present invention may use non-mammals with preexisting antibodies to 
toxins as well as non-mammals that have antibodies to whole organisms by virtue of reactions 
with the administered antigen. An example of the latter involves immunization with synthetic 
peptides or recombinant proteins sharing epitopes with whole organism components. 

In a preferred embodiment, the method of the present invention contemplates 
immunizing non-mammals with bacterial toxin(s,. It is not intended that the present invention 
he limited to any particular toxin. In one embodiment, toxin from all clostridial bacteria 
sources (aw Table 2) are contemplated as immunogens. Examples of these toxins are C 
hutyricum neuraminidase toxin, toxins A. B. C. D. E. F. and Ci from ( . W„„,,„ . 
C. per/Hnficnx toxins «. p. r.. and t. and C. sordelUi toxins MT and I T. In a preferred 
embodiment. C. hon.fimov toxins A. B. C. 13. E. and F (or fragments thereof) are 
contemplated as immunogens. 

A particularly preferred embodiment involves the use of bacterial toxin protein or 
fragments of toxin proteins produced by molecular biological means </ e . recombinant toxin 
proteins). In a preferred embodiment, the immunogen comprises the receptor-binding domain 
{i.e.. the - 50 kl) carboxx -terminal portion of the heavy chain: also referred to as the (.' 
fragment) of C. hniulinum serotype A neurotoxin produced by recombinant DNA technology. 
In another preferred embodiment, the immunogen comprises the receptor-binding domain of 
C. hamlinum serotype B neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
of r hotulbmm serotype F. neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
ol C h<n til mum serotype CI neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
of C. hotutimnn serotype C2 neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-hinding domain region 
of r. hotulinum serotype D neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
of C homlinum serotype F neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
ol C . hoiulinum serotype G neurotoxin produced by recombinant DNA technology. In a 
preferred embodiment, the recombinant botulinal toxin proteins are expressed as fusion 
proteins ic.fi.. as histidine-lagged proteins). In a still further preferred embodiment, the 
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immunogcn is a- multivalent vaccine comprising the receptor-binding domain region of C. 
botulinum toxin from two or more toxins selected from the group consisting of type A. type 
B, type C (including CI and C2). type D. type E. and type F toxin. 

When immunization is used, the preferred non-mammal is from the class Arcs, All 
birds are contemplated (e x.. duck, ostrich, emu. turkey, etc.). A preferred bird is a chicken. 
Importantly, chicken antibody does not tlx mammalian complement. [See H.N. Benson el <//.. 
J. Immunol. 87:616 ( 1961 ) J Thus, chicken antibody will normally not cause a complement- 
dependent reaction. (A. A. Benedict and K. Yamaga. "Immunoglobulins and Antibody 
Product ion in Avian Species. " in Comparative Immunology ("J J. Marchaloni. ed.). pp. 3.15- 
375. BlackwelL Oxford ( I966).| Thus, the preferred antitoxins of the present invention will 
not exhibit complement-related side effects observed with antitoxins known presently. 

When birds are used, it is contemplated that the antibody will be obtained from either 
the bird serum or the egg. A preferred embodiment involves collection of the antibody from 
the egg: Laying hens transport immunoglobulin to the egg yolk ("IgY") in concentrations 
equal to or exceeding that found in scrum. {See R. Patterson et at., }. Immunol. 89:272 
( 1962): and S.B. Carroll and B.D. Slollar. J. Biol. Chem. 258:24 (I983>.| In addition, the 
large volume of egg yolk produced vastly exceeds the volume of serum that can be safely 
obtained from the bird over any given time period. Finally, the antibody from eggs is purer 
and more homogeneous: there is far less non-immunoglobulin protein (as compared to serum) 
and only one class of immunoglobulin is transported to the yolk. 

When considering immunization with toxins, one may consider modification of the 
toxins to reduce the toxicity. In this regard, it is not intended that the present invention be 
limited 'by immunization with modified toxin. Unmodified ("native") toxin is also 
contemplated as an immunogen. 

It is also not intended that the present invention be limited by the type of modification 
-- if modification is used. The present invention contemplates all types of toxin modification, 
including chemical and heat treatment of the toxin. The preferred modification, however, is 
formaldehyde treatment. 

It is not intended that the present invention be limited to a particular mode of 
immunization: the present invention contemplates all modes of immunization, including 
subcutaneous, intramuscular, intraperitoneal, and intravenous or intravascular injection, as well 
as per o\ administration of immunogen. 
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The present invention further contemplates immunization with or without adjuvant. 
(Adjuvant is defined as a substance known to increase the immune response to other antigens 
when administered with other antigens.) If adjuvant is used, it is not iritended that the present 
invention be limited to any particular type of adjuvant « or that the same adjuvant, once used, 
be used all the time. While the present invention contemplates all types of adjuvant, whether 
used separately or in combinations, the preferred use of adjuvant is the use of Complete 
Freund s Adjuvant followed sometime later with Incomplete i reund s Adjuvant. Another 
preferred use of adjuvant is the use of Gerbu Adjuvant. Hie invention also contemplates the 
use of RIBI fowl adjuvant and Quil A adjuvant. 

When immunization is used, the present invention contemplates a wide variety of 
immunization schedules. In one embodiment, a chicken is administered toxin(s) on day zero 
and subsequently receives toxin(s) in intervals thereafter. It is not intended that the present 
invention be limited by the particular intervals or doses. Similarly, it is not intended that the 
present invention be limited to any particular schedule for collecting antibody. The preferred 
15 collection time is sometime after day 100. 

Where birds are used and collection of antibody is performed by collecting cutis, the 
eggs may be stored prior to processing for antibody. It is preferred that eggs be stored at 4°C 
for less than one year. 

It is contemplated that chicken antibody produced in this manner can be buffer- 
20 extracted and used analytically. While unpurified. this preparation can serve as a reference 
for activity of the antibody prior to further manipulations (e.g.. immunoafTinity purification). 

III. Increasing The Effectiveness Of Antibodies 

When purification is used, the present invention contemplates purifying to increase the 
25 cltecliveness ol both non-mammalian antitoxins and mammalian antitoxins. Specifically, the 
present invention contemplates increasing the percent of toxin-reactive immunoglobulin.' The 
prelerred purification approach for avian antibody is polyethylene glycol (PEG) separation. 

The present invention contemplates that avian antibody be initially purified using 
simple, inexpensive procedures. In one embodiment, chicken antibody from eggs is purified 
by extraction and precipitation with PEG. PEG purification exploits the differential solubility 
ol lipids (which are abundant in egg yolks) and yolk proteins in high concentrations of PUG 
8000. [Poison a «/.. Immunol. Comm. 9:495 (I9K0).J The technique is rapid, simple, and 
relatively inexpensive and yields an immunoglobulin fraction that is significantly purer in 
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terms of contaminaiing non-immunoglobulin proteins than the comparable ammonium sulfate 
fractions of mammalian sera and horse antibodies. The majority of the PEG is removed from 
the precipitated chicken immunoglobulin bv treatment with ethanol. 'indeed. PEG-purilled 
antibody is sufficiently pure that the present invention contemplates the use of PEG-purified 
antitoxins in the passive immunization of intoxicated humans and animals. 

IV. Treatment 

The present invention contemplates antitoxin therapy for humans and other animals 
intoxicated by bacterial toxins. A preferred method of treatment is by intravenous 
administration of anti-boutlinal antitoxin: oral administration is also contemplated for other 
clostridial antitoxins. 

A. Dosage Of Antitoxin 

It was noted by way of background that a balance must be struck when administering 
currently available antitoxin which is usually produced in large animals such as horses: 
sufficient antitoxin must be administered to neutralize the toxin, but not so much antitoxin as 
to increase the risk of untoward side effects. These side effects are caused by: i) patient 
sensitivity to foreign <c,^ horse) proteins: ii) anaphylactic or immunogenic properties of non- 
immunoglobulin proteins: iii) the complement fixing properties of mammalian antibodies: 
and/or ivj.ihc overall burden of foreign protein administered. It is extremely difficult to 
strike this balance when, as noted above, the degree of intoxication (and hence the level of 
antitoxin therapy needed) can only be approximated. 

I he present invention contemplates significantly reducing side effects so that this 
balance is more easily achieved. Treatment according to the present invention contemplates 
reducing side effects by using PEG-purilied antitoxin from birds. 

In one embodiment, the treatment of the present invention contemplates the use of 
PI^G-purified antitoxin from birds. The use of yolk-derived. PEG-purfficd antibody as 
antitoxin allows lor the administration of: I) non(mammalian)-complemcnt-nxing. avian 
antibody: 2) a less heterogeneous mixture of non-immunoglobulin proteins: and 3) less total 
protein to deliver the equivalent weight of active antibody present in currently available 
antitoxins. The non-mammalian source of the antitoxin makes it usef ul for treating patients 
who are sensitive to horse or other mammalian sera. 
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B. Delivery Of Antitoxin 

Although it is not intended to limit the route of delivery, the present invention 
contemplate, a method for antitoxin treatment of bacterial intoxication in which delivery of 
antitoxin is oral. In one embodiment, antitoxin is delivered in a solid form tablets,. | n 

an alternative embodiment antitoxin is delivered in an aqueous solution. When an aqueous 
solution is used, the solution has sufficient ionic strength to solubilizc antibodv protein, vet is 
made palatable tor oral administration. The delivery solution mav also be buffered U< K 
carbonate buffer p. I 9. 5 ) which can neutralize stomach acids and stabilize the antibodies when 
the anubod.es are administered orally. In one embodiment the delivery solution is an aqueous 
solution: In another embodiment the delivery solution is a nutritional formula. Prelerablv 
the delivery solution is infant formula. Yet another embodiment contemplates the deliverv of 
lyophilized antibody encapsulated or microencapsulated inside acid-resistant compounds. " 

Methods of applying enteric coatings to pharmaceutical compounds are well known to 
the art |companies specializing in the coating of pharmaceutical compounds are available: for 
example. The Coating Place (Verona. Wl) and AAI (Wilmington. NC)). faerie coatinits 
which are resistant to gastric fluid and whose release (/.,.. dissolution of the coating to release 
the pharmaceutical compound) is pH dependent are commercially available |for example, the 
polymethaerylates Ludragit* I. and KudragitUO S (Rohm GmbH>|. Hudragit.K. S is soluble in 
intestinal lluid from pi I 7.0; this coating can be used to microencapsulate .yophi.izcd antitoxin 
antibodies and the particles are suspended in a solution having a pll above or below p|| 7.0 
lor oral administration. The microparticles will remain intact and undissolved until thev 
reached the intestines where .he intestinal P H would cause them to dissolve .hereby releasing 
the antitoxin. 

The invention contemplates a method of treatment which can be administered for 
treatment of acute intoxication. In one embodiment, antitoxin is administered orally in either 
a delivery solution or in tablet form, in therapeutic dosage, to a subject intox.cated by the 
bacterial toxin which served as immunogen for the antitoxin. 

The invention also contemplates a method of treatment which can be administered 
prophylaciically. In one embodiment, antitoxin is administered orally, in a delivery solution, 
in therapeutic dosage, to a subject, to prevent intoxication of the subject by the bacterial toxin 
which served as immunogen for the production of antitoxin. | n another embodiment, 
antitoxin is administered orally in solid form such as tablets or as microencapsulated particles. 
Microencapsulation of lyophilized antibody using compounds such as liudragit® (Rohm 
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GmbH) or polyethylene glycol . which dissolve at a wide range of pH units, allows the oral 
administration of solid antitoxin in a liquid form (i.e.. a suspension) to recipients unable to 
tolerate administration of tablets children or patients on feeding 1 tubes). In one preferred 
embodiment the subject is a child. In another embodiment, antibody raised against whole 
bacterial organism is administered orally to a subject, in a delivery solution, in therapeutic 
dosage. 

V. Vaccines Against Clostridial Species 

The invention contemplates the generation dfmonu- and multivalent vaccines for the 
protection of an animal (particularly humans) against several clostridial species. Of particular 
interest are vaccines which stimulate the production of a humoral immune response to ( \ 
hoiulimtm. i \ teumi and C difficile in humans. The antigens comprising the vaccine 
preparation may he native or recombinant^ produced toxin proteins from the clostridial 
species listed above. When toxin proteins are used as immunogens they are generally 
modified to reduce the toxicity. This modification may be by chemical or genetic (i.e.. 
recombinant DNA technology) means. In general genetic detoxification (i.e.. the expression 
of nontoxic fragments in a host cell) is preferred as the expression of nontoxic fragments in a 
host cell precludes the presence of intact, active toxin in the final preparation. However, 
when chemical modification is desired, the preferred toxin modification is formaldehyde 
20 treatment. 

The invention contemplates that recombinant C. hatulimmi toxin proteins be used as 
antigens in mono- and multivalent vaccine preparations. Soluble, substantially endotoxin- free 
recombinant ( \ hotniimtm toxin proteins derived from serotypes A. B and E may be used 
individually {i.e.. as mono-valent vaccines) or in combination (#.t\. as a multi-valent vaccine). 
In addition, the recombinant (\ hotutimtm toxin proteins derived from serotpes A. B and F. 
may be used in conjunction with either recombinant or native toxins or toxoids from other 
serotypes of C. boiulinunu C. difficile and C. letcmi as antigens for the preparation of these 
mono- and multivalent vaccines. It is contemplated that, due to the structural similarity of <\ 
hoiulinum and C. teumi toxin proteins, a vaccine comprising L\ difficile and hotidimim toxin 
proteins (native or recombinant or a mixture thereof be used to stimulate an immune 
response against C. botulinum, t\ feutni and C. difficile. 
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The present invention further contemplates rhulti-valent vaccines comprising two or 
more botulinal toxin proteins selected from the group comprising recombinant C. hotulinum 
toxin proteins derived from serotypes A. B. C (including CI and C2). D. E. F and G. 

Hie adverse consequences ol" exposure to botulinal toxin would be avoided by 
immunization of subjects at risk of exposure to the toxin with nontoxic preparations which 
confer immunity such as chemically or genetically detoxified toxin. 

Vaccines which confer immunity against one or more of the toxin types A. B. F.. I 
and G would be useful as a means of protecting humans from the deleterious effects of those 
( V hotulinum toxins known to affect man. Indeed as the possibility exists that humans could 
be exposed to any of the seven serotypes of C. hotulinum toxin {e.ft,. during biological 
warfare or the production of toxin in a laboratory setting), multivalent vaccines capable of 
conferring immunity against toxin types A-G (including both CI and C2 toxins) would be 
useful for the protection of humans. Vaccines which confer immunity against one or more of 
the toxin -types C. D and F. would be useful for veterinary applications. 

The botulinal neurotoxin is synthesized as a single polypeptide chain which is 
processed into a heavy (H: -100 kD) and a light (L: -50 kD) chain by cleavage with proteolytic 
enzymes: these two chains are held together via disulfide bonds in the active toxin (referred to 
as derivative toxin) [B R. DasGupta and 11. Sugiyama. Bipchcm. Biophys. Res. Commun. 
48:108 (1972): reviewed in B.R. DasGupta. J. Physiol. 84:220 (1990). H. Sugiyama. 
Microbiol. Rev. 44:419 (1980) and C.L. Hatheway. Clin. Microbiol. Rev. 3:66 (1990)] The 
heavy chain of the active toxin is cleaved by trypsin to produce two fragments termed 11, 
(also referred to as H, or C) and 11 N (also referred to as II, or B). The 11, fragment (-46 kD) 
comprises the carboxy end of the II chain! The H v fragment (-49 kD) comprises the animo 
end and remains attached to the L chain (H N L). Neither H t or H N I. is toxic. 11, competes 
with whole derivative toxin for binding to synaptosomes and therefore 1 1, is said to contain 
the receptor binding site. The 1 1,. and H„ fragments of botulinal toxin are analogous to the 
fragments C and B of tetanus toxin which are produced by papain cleavage. The C fragment 
of tetanus toxin has been shown to be responsible for the binding of tetanus toxin to purified 
gangliosides and neuronal cells (Halpern and Loftus. J. Biol. Chem. 288:1 1 188 (1993)]. 

Antisera raised against purified preparations of isolated botulinal I I and I. chains have 
been shown to protect mice against the lethal effects of the toxin: however, the effectiveness 
of the two antisera differ with the anli-H sera being more potent (II. Sugiyama. supra). 
While the different botulinal toxins show structural similarity to one another, the different 
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serotypes are reported to be immunologically distinct (/ c, sera raised against one toxin type 
does not cross-react to a significant degree with other types). Thus, the generation of 
multivalent vaccines may require the use of more than one type of toxin. 

C. hondinum toxin genes from all seven serotypes have been cloned and sequenced 
(Minion (1995). supra): in addition, partial amino acid sequence is available for a number of 
('. hotulinum toxins isolated from different strains within a given serotype. The C. hot ul mum 
toxins contain about 1250-1300 amino acid residues. On the DNA level, the overall degree of 
homology between C. hotulinum serotypes A, B. C\ D and E toxins averages between 50 and 
60% identity with a greater degree of homology being found between H chain-encoding 
regions than between those encoding L chains [Whelan et al. (1992) Appl. Environ. 
Microbiol. 58:2345}. The degree of identity between C. hotulinum toxins on the amino acid 
level reflects 'the level of DNA sequence homology. The most divergent area of DNA and 
amino acid sequence is found within the carboxy-terminal area of the various C* hotulinum \ \ 
chain genes. This portion of the toxin (i.e.. H r or the C fragment) plays a major role in cell 
binding. As toxin from different serotypes is thought to bind to distinct cell receptor 
molecules, it is not surprising that the toxins diverge significantly over this region. 

Within a given serotype, small variations in the primary amino acid sequence of the 
botulinal toxins isolated from different strains has been reported | Whelan et al. { 1992). supra 
and Minion (1995). supra]. The present invention contemplates fusion proteins comprising 
portions of C hotulinum toxins from serotypes A-G including the variants found among 
different strains within a given serotype. The present invention provides oligonucleotide 
primers which may be used to amplify the C fragment or receptor-binding region of the toxin 
gene from various strains of C hotulinum serotype A. serotype B. serotype C (CI and C2). 
serotype I), serotype E. serotype F and serotype G. A large number of different strains of (*. 
hotulinum serotype A. serotype B. serotype C. serotype D serotype E and serotype F are 
available from the American Type Culture Collection (ATCC: Rockville. MD). For example, 
the ATCC provides the following: Type A strains: 174 (ATCC 3502). 457 (ATCC 17862). 
and NCTC 7272 (ATCC 19397); Type B strains: 34 (ATCC 439). 62 A (ATCC 7948). NCA 
213 B (ATCC 7949), 13114 (ATCC 8083). 3137 (ATCC 17780). 1347 (ATCC 17841). 2017 
(ATCC 17843). 2217 (ATCC 17844). 2254 (ATCC 17845) and VP 1731 (ATCC 25765): 
Type C strains: 2220 (ATCC 17782). 2239 (ATCC 17783). 2223 (ATCC 17784: a type C-p 
strain: C\p strains produce C2 toxin). 662 (ATCC 17849; a type C-ot strain; C-cx strains 
produce mainly CI toxin and a small amount of C2 toxin). 2021 (ATCC 17850: a type C-ct 
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strain) and y PI 3803 (ATCC 25766): Type. D strains: ATCC 9633. 2023 (ATCC 17851). and 
VPI 5995 (ATCC 27517); Type li strains: ATCC 43181. 36208 (ATCC 9564) 2231 (ATCC 
1 7786). 2229 (ATCC 17852). 2279 (ATCC 17854) and 2285 (ATCC (7855) and Type F 
strains: 202F (ATCC 23387). VPI 4404 (ATCC 25764). VPI 2382 (ATCC 27321) and 
Lanyeland (ATCC 35415). Type G strain. 1 13/30 (NCFB 3012) may be obtained from the 
National Collection of Food Bacteria (NCFB. AFRC Institute of Food Research. Reading. 
United Kingdom). 

Purification, methods have been reported lor native toxin types A. B. C. D. E. and F 
|rcvicwed in G. Sakaguchi. Pharmac. Ther. 19:165 (I983)|. As the different botulinal toxins 
are structurally related, the invention contemplates the expression of any of the botulinal 
toxins icy., types A-G) as soluble recombinant fusion proteins: 

In particular, methods for purification of the type A botulinum neurotoxin have been 
developed JI..J. Moberg and H. Sugiyama. Appl. linviron. Microbiol. 35:878 (I978)|. 
Immunization of hens with detoxified purified protein results in the generation of neutralizing 
antibodies |B,S. Tlialley et at., in Botulinum and Tetanus Neurotoxins. B.R. DasGupta. ed 
Plenum Press. New York (1993). p. 467]. 

1 he currently available C. botulinum pentavalenl vaccine comprising chemically 
inactivated </.<.■.. formaldehyde treated) type A. B. C. D and F. toxins is not adequate. The 
efficacy is variable (in particular, only 78% of recipients produce protective levels of ami-type 
U antibodies following administration of the primary scries) and immunization is painful 
(deep subcutaneous inoculation is required for administration), with adverse reactions being 
common (moderate to severe local reactions occur in approximately 6% of recipients upon 
initial injection: this number rises to approximately I 1% of individuals who receive booster 
injections) (Informational Brochure for the Pcntavalent (ABCDI-) Botulinum Toxoid. Centers 
lor Disease Control |. Preparation of this vaccine is dangerous as active toxin must be handled 
by laboratory workers. 

In general, chemical detoxification of bacterial toxins using agents such as 
formaldehyde, glutaraldehyde or hydrogen peroxide is not optimal for the generation of 
vaccines or antitoxins. A delicate balance must be struck between too much and too little 
chemical modification. If the treatment is insufficient, the vaccine may retain residual 
toxicity. If the treatment is too excessive, the vaccine may lose potency due to destruction of 
native immunogenic determinants. Another major limitation of using botulinal toxoids for the 
generation of antitoxins or vaccines is the high production expense. For the above reasons. 
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the development of methods for the production of nontoxic but immunogenic C. boiulinum 
toxin proteins is desirable. 

The C boiulinum and C tetanus toxin proteins have similar structures [reviewed in 
F..J. Schantz and E.A. Johnson. Microbiol. Rev. 56:80 ( 1992) J. The carboxy-terminal 50 kD 
fragment of the tetanus toxin heavy chain (fragment C) is released by papain cleavage and has 
been shown to be non-toxic and immunogenic. Recombinant tetanus toxin fragment C has 
been developed as a candidate vaccine antigen [A.J. Makofl et al.. Bio/Technology 7:1043 
( I989)|. Mice immunized with recombinant tetanus toxin fragment C were protected from 
challenge with lethal doses of tetanus toxin. No studies have demonstrated that the 
recombinant tetanus fragment C protein confers immunity against other botulinal toxins such 
as the C hotulinum toxins. 

Recombinant tetanus fragment C has been expressed in E. coll (A.J. Makoff et at.. 
Bio/Technology, supra and Nucleic Acids Res. 17:10191 (1989): J.I.. Halpern et al.. Infect. 
Immun. 58. 1004 (1990)J. yeast f M.A. Romanos et al.. Nucleic Acids Res. 19:1461 (I99i)| 
and baculovirus [I.G. Charles et al.. Infect. Immun. 59.1627 (1-991 )|. Synthetic tetanus toxin 
genes had to be constructed to facilitate expression in yeast (M.A. Romanos et al.. supra) and 
E. eoli f A..I. Makoff et al.. Nucleic Acids Res.. supra], due to the high A- T content of the 
tetanus toxin gone sequences. High A-T content is a common feature of clostridial genes 
(M.R. Popoff c/ al.. Infect. Immun. 59:3673 (1991); H.F. LaPenotierc et al.. in Botulinum and 
20 Tetanus Xeuroioxtns. BR. DasGupta. cd.. Plenum Press. New York (1993). p. 463 1 which 
creates expression difficulties in E. eo/i and yeast due primarily lo altered codon usage 
frequency and fortuitous polyadem lalion sites, respectivclv. 

The C fragment of the C. boiulinum type A neurotoxin heavy chain has been evaluated 
as a vaccine candidate. The C. boiulinum type A neurotoxin gene has been cloned and 
sequenced [D.li. Thompson et al.. Eur. J. Biochem. 189:73 (I990)J. The C fragment of the 
type A toxin was expressed as either a fusion protein comprising the botulinal C fragment 
fused wiih the maltose binding protein (MBP) or as a native protein |H.F. l.aPenotiere el al.. 
(1993) supra. II.F. LaPenotiere ei al.. Toxicon. 33: 1 383 ( 1995) and Middlebrook and Brown 
(1995). Curr. Top. Microbiol. Immunol. 195:89-1221. The plasmid construct encoding the 
native protein was reported to be unstable, while the fusion protein was expressed primarily in 
inclusion bodies as insoluble protein. Immunization of mice with crudely purilied MBP 
fusion protein resulted in protection against IP challenge with 3 LD<„ doses of toxin 
H-aPenotiere et al.. (1993) and (1995). supra). However, this recombinant C. hotulinum type 
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A toxin C fragment/MBP fusion protein is not a suitable immunogen for the production of 
vaccines as it is expressed as an insoluble protein in E. coli. Furthermore, this recombinant 
C. botulinum type A toxin C fragment/MBP fusion protein was not shown to be substantially 
free of endotoxin contamination. Experience with recombinant C. botulinum type A toxin C 
fragment/MBP fusion proteins shows that the presence of the MBP on the fusion protein 
greatly complicates the removal of endotoxin from preparations of the recombinant fusion 
protein (sec Ex. 24. infra). Expression of a synthetic gene encoding C botulinum type A 
toxin C fragment as a soluble protejn excreted from insect cells has been reported 
IMiddlcbrook and Brown (1995). supra]: no details regarding the level of expression achieved 
or the presence of endotoxin or other pyrogens were provided. Like the insoluble protein 
expressed in E. coli. immunization with the recombinant protein produced in insect cells was 
reported to protect mice from challenge with C. botulinum toxin A. 

Inclusion body protein must be solubilized prior to purification and/or administration 
to a host. The harsh treatment of inclusion body protein needed to accomplish this 
solubilization may reduce the immunogenicity of the purified protein. Ideally, recombinant 
proteins to he used as vaccines are expressed as soluble proteins at high levels </.<.... greater 
than or equal to about 0.75% of total cellular protein) in E. coli or other host cells (ex. 
yeast, insect cells, etc.). This facilitates the production and isolation of sufficient quantities of 
the immunogen in a highly purified form (i.e.. substantially free ol endotoxin or other 
pyrogen contamination). The ability to express recombinant toxin proteins as soluble proteins 
in E. coli is advantageous due to the low cost of growth compared hi insect or mammalian 
tissue culture cells. 

The C. botulinum type B neurotoxin gene has been cloned and sequenced from two 
strains of C botulinum type B [Whelan ct at. (1992) Appl. Environ. Microbiol. 58:2345 
(Danish strain) and Hutson at al. (1994) Curr. Microbiol. 28:101 (Eklund 17B strain)]. The 
nucleotide sequence of the toxin gene derived from the Eklund I7B strain (ATCC 25765) is 
available from the EMBL/GenBank sequence data banks under the accession number X7I343: 
the nucleotide sequence of the coding region is listed in SEQ ID NO:39. The amino acid 
sequence of the C. botulinum type B neurotoxin derived from the strain Eklund I7B is listed 
in SEQ ID NO:40. The nucleotide sequence of the C botulinum serotype B toxin gene 
derived from the Danish strain is listed in SEQ ID NO:4l. The amino acid sequence of the 
C. botulinum type B neurotoxin derived from the Danish strain is listed in SEQ ID NO:42. 
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The C. hotulinum type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds. The light chain is responsible for pharmacological activity (i.e.. inhibition of 
the release of acetylcholine at the neuromuscular junction). The N-terminal portion of the 
heavy chain is thought to mediate channel formation while the C-terminal portion mediates 
toxin binding: the type B neurotoxin has been reported to exist as a mixture of predominantly 
single chain with some double chain ( Whelan et «/.. supra). The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the 1 1,, domain. The present 
invention reports for the first time, the expression of the C fragment of C. hotulinum type B 
toxin in heterologous hosts E. co/i). 

The C. hotulinum type E neurotoxin gene has been cloned and sequenced from a 
number of different strains [Poulct et al. (1992) Biochem. Biophys. Res. Commun. 183:107: 
Whelan e, al. (1992) Eur. J. Biochenv 204:657: and Fujii etui. (1993) J. Gen. Microbiol. 
!39:79|. The nucleotide sequence of the type E toxin gene is available from the fiMBL 
sequence data bank under accession numbers X62089 (strain Beluga) and X62683 (strain 
NCIC 1 1219): the nucleotide sequence of the coding region (strain Beluga) is listed in Sf-Q 
ID NO:45. The amino acid sequence of the C hotulinum type E neurotoxin derived from 
sirain Beluga is listed in SEQ ID NO:46. The type E neurotoxin gene is synthesized as a 
single polypeptide chain which may be converted to a double-chain form (i.e.. a heavy chain 
and a light chain) by cleavage with trypsin: un|ike the type A neurotoxin, the type E 
neurotoxin exists essentially only in the single-chain form. The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the H,. domain. The present 
invention reports for the first time, the expression of the C fragment of C. hotulinum type E 
toxin in heterologous hosts (c .t:.. £. co/i). 

The C hotulinum type CI. D. F and G neurotoxin genes have been cloned and 
sequenced. The nucleotide and amino acid sequences of these genes and toxins are provided 
herein. The invention provides methods for the expression of the C fragment from each of 
these toxin genes in heterologous hosts and the purification of the resulting recombinant 
proteins. 

The subject invention provides methods which allow the production of soluble C. 
hotulinum toxin proteins in economical host cells {e.f>.. E. tali). In addition the subject 
invention provides methods which allow the production of soluble botulinal toxin proteins in 
yeast and insect cells. Further, methods for the isolation of purified soluble C. hotulinum 
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toxin proteins which are suitable for immunization of humans and other animals are provided. 
These soluble, purified preparations of C botulinum toxin proteins provide the basis for 
improved vaccine preparations and facilitate the production of 

When recombinant clostridial toxin proteins produced in gram-negative bacteria (c.^.. 
£. volt) are used as vaccines, they are purified to remove endotoxin prior to administration to 
a host animal. In order to vaccinate a host, an immunogcnically-cffective amount of purified 
substantially endotoxin-free recombinant clostridial toxin protein is administered in any of a 
number of physiologically acceptable carriers known to the art. When administered for the 
purpose of vaccination, the purified substantially endotoxin-free recombinant clostridial toxin 
protein may be used alone or in conjunction with known adjutants, including potassium alum, 
aluminum phosphate, aluminum hydroxide, Gerbu adjuvant (GmDP: C.C. Biotech Corp.), 
RIBI adjuvant (MPL; RIBI Immunochemical Research. Inc.). QS21 (Cambridge Biotech). 
The alum and aluminum-based adjutants arc particularly preferred when vaccines arc to be 
administered to humans; however, any adjuvant approved for use in humans may be 
employed. .The route of immunization may be nasal, oral, intramuscular, intraperitoneal or 
subcutaneous. 

The invention contemplates the use of soluble, substantially endotoxin-free 
preparations of fusion proteins comprising the C fragment of the ( botulinum type A. B C, 
D. E. F. and G toxin as vaccines. In one embodiment, the vaccine comprises the C fragment 
of either the C hotidinum type A. B. C, D. E. F. or G toxin and a poly-histidine tract (also 
called a histidine tag). In a particularly preferred embodiment, a fusion protein comprising 
the histidine tagged C fragment is expressed using the pET scries of expression vectors 
(Novagen). The pET expression system utilizes a vector containing the T7 promoter which 
encodes the fusion protein and a host cell which can be induced to express the T7 DNA 
polymerase (i.e.. a DE3 host strain). The production of C fragment fusion proteins containing 
a histidine tract is not limited to the use of a particular expression vector and host strain. 
Several commercially available expression vectors and host strains can be used to express the 
C fragment protein sequences as a fusion protein containing a histidine tract (For example, the 
pQE series (pQE-8. 12, 16, 17. 18. 30. 31. 32. 40. 41. 42. 50. 51. 52. 60 and 70) of 
expression vectors (Qiagcn) which are used with the host strains MI5fpREP4| (Qiagen) and 
SG13009[pREP4| (Qiagen) can be used to express fusion proteins containing six histidine 
residues at the amino-terminus of the fusion protein). Furthermore a number of commercially 
available expression vectors which provide a histidine tract also provide, a protease cleavage 
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site between the histidine tract and the protein of interest U'.jr. botulinal toxin sequences). 
Cleavage of the resulting fusion protein with the appropriate protease will remove the 
histidine tag from the protein of interest { e .g.. botulinal toxin sequences) (see Example 28x 
infra). Removal of the histidine tag may be desirable prior to administration of the 
recombinant botulinal toxin protein to a subject (e.g.. a human). 

VI. Detection Of Toxin 

The invention contemplates detecting bacterial toxin in a sample. The term ' sample" 
in the present specification and claims is used in its broadest sense. On the one hand it is 
meant to include a specimen or culture. On the other hand, it is meant to include both 
biological and environmental samples, 

Hiologica! samples may be animal, including human. Iluid. solid (e.g.; stool) or tissue: 
liquid and solid food products and ingredients such as dairy items, vegetables, meat and meat 
by-products, and waste. Environmental samples include environmental material such as 
surface matter, soil, water and industrial samples, as well as samples obtained from food and 
dairy processing instruments, apparatus, equipment, disposable and non-disposable items. 
These examples are not to be construed as limiting the sample types applicable to the present 



invention. 



The invention contemplates detecting bacterial toxin by a competitive immunoassay 
method that utilizes recombinant toxin A and toxin B proteins, antibodies raised against 
recombinant bacterial toxin proteins. A fixed amount of the recombinant toxin proteins are 
immobilized to a solid support (c.^.. a microliter plate) followed by the addition of a 
biological sample suspected of containing a bacterial toxin. The biological sample is first 
mixed with affinity-purified or VEG fractionated antibodies directed against the recombinant 
toxin protein. A reporter reagent is then added which is capable of detecting the presence of 
antibody bound to the immobilized toxin protein. The reporter substance may comprise an 
antibody with binding specificity for the antitoxin attached to a molecule which is used to 
identif y the presence of the reporter substance. If toxin is present in the sample, this toxin 
will compete with the immobilized recombinant toxin protein for binding to the anti- 
recombinam antibody thereby reducing the signal obtained following the addition of the 
reporter reagent. A control is employed where the antibody is not mixed with the sample. 
This gives the highest <or reference) signal. 
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The invention also contemplates detecting bacterial toxin "by a "sandwich" 
immunoassay method that utilizes antibodies directed against recombinant bacterial toxin 
proteins. Affinity-purified antibodies directed against recombinant bacterial toxin proteins arc 
immobilized to a solid support microliter plates). Biological samples suspected of 

containing bacterial toxins are then added followed by a washing step (o remove substantially 
all unbound antitoxin. The biological sample is next exposed to the reporter substance, which 
binds to antitoxin and is then washed free of substantially all unbound reporter substance. 
The reporter substance may comprise an antibody with binding specificity for the antitoxin 
attached to a molecule which is used to identify the presence of the reporter substance. 
Identillcation of the reporter substance in the biological tissue indicates the presence of the 
bacterial toxin. 

It is also contemplated that bacterial toxin be detected by pouring liquids u^.; soups 
and other tluid foods and feeds including nutritional supplements lor humans and other 
animals) over immobilized antibody which is directed against the bacterial toxin. |, is 
contemplated that the immobilized antibody will be present in or on such supports as 
cartridges, columns, beads, or any other solid support medium. In one embodiment, following 
the exposure of the liquid to the immobilized antibody, unbound toxin is subslamiallv 
removed by. washing. The exposure of the liquid .s then exposed to a reporter substance 
which detects the presence of bound toxin. In a preferred embodiment the reporter substance 
is an cnzyriic. fluorescent dye. or radioactive compound attached to an antibody which is 
directed against, the toxin (i.e.. in a "sandwich" immunoassay). It is also contemplated that 
.the detection system will be developed as necessary ,he addition. of enzyme substrate in 

enzyme, systems: observation using fluorescent light for fluorescent dye systems: and 
quantitation of radioactivity for radioactive systems). 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the disclosure which follows, the following abbreviations apply: "C (degrees 
Centigrade »: rpm (revolutions per minute): BBS-Tween (borate buffered saline containing 
Twecn): BSA (bovine serum albumin): KUSA (enzyme-linked immunosorbent assay): CT A 
(complete I reund s adjuvant): II A (incomplete I reund s adjuvant): IgCi (immunoglobulin G>: 
IgY (immunoglobulin Y): IM (intramuscular): ll> (intraperitoneal): IV (intravenous or 
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intravascular): SC ( subcutaneous) : H,0 (water); HCI (hydrochloric acid): LD, 0() (lethal dose 
lor 100% of experimental animals): aa (amino acid): HPLC (high performance liquid 
chromatography); kD (kilodaltons): gm (grams): ug (micrograms); mg (milligrams): ng 
(nanograms): nl (microliters): ml (milliliters): mm (millimeters): nm (nanometers): urn 
(micrometer): M , molar): mM (millimolar); MW (molecular weight): sec (seconds): min(s) 
(minute/minutes): hr<s) (hour/hours): MgCI, (magnesium chloride): NaCI (sodium chloride); 
NuX'O, (sodium carbonate): OD. xn (optical density at 280 nm): OD,,,,, (optical density at 600 
nm): PAGE (polyacrylamide gel electrophoresis): PBS (pliosphatc buffered saline (150 mM 
NaCI. 10 mM sodium phosphate buffer. pH 7.2)]: PEG (polyethylene glycol): PMSF 
(phenylmethylsulfonyl fluoride); SDS (sodium dodecyl sulfate): Tris 

(tris(hydro.\ymeih\l)aminomcthane): Ensure* (Ensure &. Ross Laboratories. Columbus OH); 
linfamil" < Enfamil « . Mead Johnson): w/v (weight to volume): v/v (volume to volume): 
Amicon (Amicon. Inc.. Beverly. MA): Amresco ( Amresco.' Inc.. Solon. Oil): ATCC 
< American Type Culture Collection. Rockville. MO): BBL (Baltimore Biologies Laboratory. 
ta division- of Becton Dickinson). Cockeysvillc. MD): Becton Dickinson (Becton Dickinson 
Lahwarc, Lincoln Park. N.I); BioRad (BioRad. Richmond. CA): Biotech (C-C Biotech Corp.. 
Poway. CA): Charles River (Charles River Laboratories. Wilmington. MA): CocaJico 
(Cocalico -Biologicals Inc.. Kcamstown. PA): CytRx (CytRx Corp.. Norcross. (iA): Falcon 
<c.». Baxter Healthcare C orp.. McGaw Park. IL and Becton Dickinson): I DA (federal Food 
and Drug Administration): Fisher Biotech (Fisher Biotech. Springfield. N.I): GJBCO (Grand 
Island Biologic Company/URL. Grand Island. NY): Gibco-BRL (Life Technologies. Inc.. 
Gailhersburg. MD): Harlan Sprague Dawley (Harlan Sprague Dawley. Inc.. Madison. Wl): 
Mallinckrodt (a division of Baxter Healthcare Corp.. McCiaw Park. IL): Milliporc (Millipore 
Corp.. Marlborough. MA): New England Biolabs (New England Biolabs. Inc.. Beverly. MA); 
Novagcn (Novagen. Inc.. Madison. Wl); Pharmacia (Pharmacia. Inc.. Piscataway. N.1); Oiagen 
(Oiagen. Chatsworth. CA): Sasco (Sasco. Omaha. NE): Showdex (Shown Denko America. 
Inc.. New York. NY): Sigma (Sigma Chemical Co.. St. Louis. MO): Sterogene (Sterogene. 
Inc.. Arcadia. CA); Tech Lab (Tech Lab. Inc.. Blacksburg. VA): and Vaxcell (Vaxcell. Inc.. 
a subsidiary ofCytRX Corp.. Norcross. CiA). 

When a recombinant protein is described in the specification it is referred to in a 
short-hand manner by the amino acids in the toxin sequence present in the recombinant 
protein rounded to the nearest 10. Fur example, the recombinant protein pMB 1850-2360 
contains amino acids 1852 through 2362 of the C. Jiffhih toxin B protein. The specification 
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gives detailed construction details for all recombinant proteins such that one skilled in the art 
will know precisely which amino acids arc present in a giver, recombinant protein. 

EXAMPLE I 

Production OHIigh-Titcr Antibodies To Clostridium difficile Organisms In A Hen 

Antibodies to certain pathogenic organisms have been shown to be effecttvc in trea.inu 
diseases caused by those organisms. It has not been shown whether antibodies can be raised' 
against Clostridium difficile, which would be effective i„ treating infection by this organism. 
Accordingly. ( ; difficile was tested as immunogen for production of hen antibodies.' 

To determine the best course lor raising high-liter egg antibodies against whole C. 
difficile organisms, different immunizing strains and different immunizing concentrations were 
examined. The example involved (a) preparation of the bacterial immunogen. 
(H, immunization, (c) publication of anti-bacterial chicken antibodies, amUd) detection of 
anti-bacterial antibodies in the purified IgY preparations. 

a) Preparation Of Bacterial Immunogen 

( : difficile strains 4.1594 (serogroup A) and 43596 (serogroup C) were originally 
obtained from the A I CC*. These two strains were selected because .hey represent two of the 
most commonly-occurring serogroups isolated from patients with antibiotic-associated 
pseudomembranous colitis. |l)elmee ei al.. J. Clin. Microbiol.. 28< l()):221(> ( I90(».| 
Additionally, both of these strains have been previously characterized with respect to their 
virulence in the Syrian hamster model for C. difficile infection. |Delmec a al.. .1. Med 
Microbiol.. 33:85 f 1990). | 

The bacterial strains were separately cultured on brain heart infusion agar for 48 hours 
at 37°C in a Gas Pack 100 Jar <BBL. Cockcysville. MD) equipped with a Gas Pack Plus 
anaerobic envelope (BUI.,, forty-eight hour cultures were used because they produce better 
growth and the organisms have been found to be more cross-reactive with respect to their 
surface antigen presentation. The greater the degree ol cross-reactivity of our IgY 
preparations, the better the probability of a broad range of activity against different 
strains/serogroups. |Toma el al.. .1. Clin. Microbiol.. 26{3):426 ( I98X).| 

The resulting organisms were removed from the agar surface using a sterile dacron-tip 
swab, and were suspended in a solution containing 0.4% formaldehyde in PBS. pi I 7.2. This 
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concentration of formaldehyde has been reported as producing good results for the purpose of 
preparing whole-organism immunogen suspensions for the generation of polyclonal anti-C 
difficile antisera in rabbits. fDelmee el at.. J. Clin. Microbiol.. 21:323 (1985): Davies at at.. 
Microbial Path.. 9:141 (1990).] In this manner, two separate bacterial suspensions were 
prepared, one for each strain. The two suspensions were then incubated at 4°C for I hour, 
hollowing this period of formalin-treatment, the suspensions were centrifuged at 4.200 x g for 
20 min.. and the resulting pellets were washed twice in normal saline. The washed pellets, 
which contained formalin-treated whole organisms, were resuspended in fresh normal saline 
such that the visual turbidity of each suspension corresponded to a #7 McFarland standard. 
[M.A.C. Fdclstcin. "Processing C linical Specimens for Anaerobic Bacteria: Isolation and 
Identification Procedures /VS.M. Finegold et at (eds.).. Bailey and Scon's Diagnostic 
^ Microbiology, pp. 477-507. C.V. Mosby Co.. (1990). The preparation of McFarland 
ncphelometer standards and the corresponding approximate number of organisms for each 
tube arc described in detail at pp. 172-173 of this volume. | Kach of the two HI suspensions 
was then split into two separate volumes. One volume of each suspension was votumetricaily 
adjusted, by the. addition of saline, to correspond to the visual turbidity of a #! McFarland 
standard. \ld.\ The it] suspensions contained approximately 3 x 10 s organisms/mL and the 
£7 suspensions contained approximately 2 x 10 w organisms/ml. | Id.] The four resulting 
concentration-adjusted suspensions of formalin-treated C difficile organisms were considered 
to be "bacterial immunogen suspensions." These suspensions were used immediately after 
preparation for the initial immunization. [See section (b).| 

The Tbrmalin-ircatmcnt procedure did not result in 100% non-viable bacteria in the 
immunogen suspensions. In order to increase the level of killing, the formalin concentration 
and length of treatment were both increased for subsequent immunogen preparations, as 
described below in Table 3. (Although viability was decreased with the stronger formalin 
treatment. 100% inviability of the bacterial immunogen suspensions was not reached.) Also, 
m subsequent immunogen preparations, the formalin solutions were prepared in normal saline 
instead of PBS'. At day 49. the day of the fifth immunization, the excess volumes of the four 
previous bacterial immunogen suspensions were stored fro/en at -70°C for use during all 
subsequent immunizations. 
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t>) Immunization 

For the initial immunization. 1.0 ml volumes of each of the four bacterial immunogen 
-suspensions described above were separately emulsified in 1.2 ml volumes of CFA (GIBCO). 
For each of the four emulsified immunogen suspensions, two four-month old White Leuhorn 
hens (pre-Jaying) were immunized. (It is not necessary to use pre-laying hens: actively-laying 
hens can also be utilized.) Each hen received a total volume of approximately 1.0 ml of a 
single emulsified immunogen suspension via four injections (two subcutaneous and two 
intramuscular) of approximately 250 ul per site. In this manner, a total of four different 
immunization combinations, using two hens per combination, were initiated for the purpose of 
evaluating both the effect of immunizing concentration on egg yolk antibody (IgY) 
production, and iiUerslrain cross-reactivity of IgY raised against heterologous strains. The 
four immunization groups are summarized in Table 3. 





TABLE 3 

Immunization Groups 


C» roup Designation 






CD 45594. c| 


C difficile 
strain 43594 


1.5 * MP oriian isms -hen 


CO 43594. - 7 




1.0* H> organisms hen 


CD 43596. *| 


C difficile 
strain 43596 


1.5 v 10 s organisms hen 


CD 45596. *-'7 




1.0 * 10' oriianistns'hcn 



20 



I he lime point for the first series of immunisations was designated as "day zero. M All 
subsequent immunizations were performed as described above except that the bacterial 
-immunogen suspensions were emulsified using I f A (GIBCO) instead of C-'f/V and for the 
later time point immunization, the stored frozen suspensions were used instead of freshly- 
prepared suspensions. The immunization schedule used is listed in fable 4. 
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TABLE 4 

Immunization Schedule 



Day Of Immunization 


Formalin-Treatment 


I film tin Of» ii PmntririAn 1 \raA 


0 


1%. 1 hr. 


freshly- prepared 


14 


1%: overnight 




21 


1%. overnight 




35 


l°v, 48 hrs. 




49 


I°b. 72 hrs. 




70 




stored frozen 


s> 






105 







c) Purification Of Anti-Kactcrial Chicken Antibodies 

Groups of four eggs were collected per immunization group between days 80 and K4 
post-initial immunizaiion. and chicken immunoglobulin (IgY) was extracted according to a 
modilleation of the procedure of A. Poison et aL, Immunol. Comm.. 9:495 (1980). A trenile 
stream of distilled water from a squirt bottle was used to separate the yolks from the whites, 
and the yolks were broken by dropping them through a funnel into a graduated cylinder. The 
lour individual yolks were pooled for each group. The pooled, broken yolks were blended 
with 4 volumes of egg extraction buffer to improve antibody yield (egg extraction buffer is 
0.01 M sodium phosphate. 0.1 M NaCI, pH 7/5. containing 0.005% thimerosal). and PlXi 
S000 (Amresco) was added to a concentration of 3.5%. When all the PEG dissolved.- the 
protein precipitates that, formed were pelleted by cenlrifugation at 13.000 x g for 10 minutes. 
I he supernatants were decanted and filtered through cheesecloth to remove the lipid layer, 
and the PlXi was added to the supernatants to a final concentration of 12% (the supernatants 
were assumed to contain 3.5% PEG). After a second ccntrifugaiion. the supernatants were 
discarded and the pellets were centrifuged a final lime lo extrude the remaining PlXi. These 
crude IgY pellets were then dissolved in the original yolk volume of egg extraction buffer and 
stored at 4°C\ As an additional control, a preimmune IgY solution was prepared as described 
above, using eygs collected from unimmunized hens. 
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d) Detection Of Anti-Bacterial Antibodies In The Purified | K Y 
Preparations 

In order to evaluate the relative levels of specific am* *. difficile activity in the Ig Y 
preparations described above, a modified version of the whole-organism ,ELISA procedure of 
N.V. Padhyc e, ui. 1. Clin. Microbiol. 29:99,103 (,990, was used. Frozen organisms of both 
< difficile strains described above were thawed and diluted to a concentration of 
approximately I x 10' organisms/ml using PBS. P H 7.2. In this way. two separate coatinu 
suspensions were prepared, one for each immunizing strain. Into the wells of %-well 
microliter plates ., Falcon. Pro-Bind Assay Plates, were placed . 00 M l volumes of the coating 
-suspensions. In this manner, each plate well received a total of approximately I * 10" 
organisms of one stra.n or the other. The plates were then incubated at 4"C overnight. The 
next morning, the coating suspensions were decanted, and all wells were washed three times 
using PBS. |„ order to block non-specific binding sixes. 100 ul of 6.5% BSA (Sigma/in PBS 
was then added ,„ each well, and the plates were incubated Tor 2 hours a, room temperature. 
I he blocking solution was decanted, and 100 ul volumes of the IgY preparations described 
above were initially diluted 1.500 with a solution of 0.1% BSA in PBS. and then serially 
diluted in 1:5 steps. The following dilutions were placed in the wells: 1:500. 1:2 500 
1:62.5000. 1,1,2.500. and 1:1.562.500. The p.ates were .gain incubated for 2 hours a, room 
temperature, following this incubation, the IgY-containing solutions were decanted, and the 
wells were washed three times using BBS-Twecn fb/l M bone acid. 0.025 M sodium borate. 
1.0 M NaCI. 0.1% Tween-20). followed by two washes using PBS-Twecn (0.1% Tween-20). 
and linally. ,wo washes using PBS only. To each well. 100 M l of a 1:750 dilution of rabbi, 
anti-chicken lg<i (wholc-moleculet-alkalinc phosphatase conjugate (Sigma) (diluted in 0.1% 
BSA in PBS) was added. The plates were again incubated for 2 hours at room temperature. 
The conjugate solutions were decanted and the plates were washed as described above, 
substituting 50 niM NaXO,. pll 9.5 for the PBS in the final wash. The plates were 
developed by the addition of 100 M l of a solution containing 1 m g/ml para-n.trophenyl 
phosphate (Sigma) dissolved in 50 niM Na.CX),. 10 „,M MuC I,. pll 9.5 ,o each welL and 
incubating the plates at room temperature in the dark for 45 minutes. The absorbance of each 
well was measured at 410 nm using a Dynatcch MR -700 plate reader. In this manner, each of 
the lour IgY preparations described above was tested for reactivity against both of the 
immunizing ( •. difficile strains: strain-specific, as well as cross-reactive activity was 
determined. 
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Table 5 shows the results of the whole-organism ELTSA All four IgY preparations 
demonstrated significant levels of activity, to a dilution of 1:62.500 or greater against both of 
the immunizing organism strains. Therefore, antibodies raised against one strain were highly 
cross-reactive with the other strain, and vice versa. The immunizing concentration of 
organisms did not have a significant effect on organism-specific IgY production, as both 
concentrations produced approximately equivalent responses. Therefore, the lower 
immunizing concentration of approximately 1 .5 x 1 0* organisms/hen is the preferred 
immunizing concentration of the two tested. The prcimmunc IgY preparation appeared to 
possess relatively low levels of ( <///// t //c-reactive activity to a dilution of 1 :500. probably 
due to prior exposure of the animals to environmental Clostridia. 

An initial whole-organism F.LISA was performed using IgY preparations made from 
single C D 43504. ti\ and CD 43596. #1 eggs collected around day 50 (data not show.,,. 
Specific titers were found to be 5 in 10-fold lower than those reported in Table 5. These 
results demonstrate that .t is possible to begin immunizing hens prior to the time that they 
begin to lay eggs, and to obtain high titer specific IgY from the first eggs that are laid. In 
other words, i, is not necessary to wait for the hens to begin laying before the immunization 
schedule is started.-' 
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TABLE 5 



!() 



I O ir ■ Prpnirai inn 

■ * 1 reparation 


Dilution Of IgY Prep 


43594-Coated Wells 


43596-Coated Wells 




1:500 


1.746 


1.801 




1 :2.500 


1 .092 


1.670 


C D 43594. b I ■ 


1 . 1 2. 300 


0.202 


0.812 




! .0.1.500 


0.136 


0. 1 79 




I.-mJ.MjO 


0.012 


0.080 




1. 1-562.500 


0.002 


O.020 




1:500 


I.7R0 


1. 771 




1:2.500 


1.025 


1.078 


CD 43594. s7 


1:12.500 


0.188 


0.382 




1 ;62.500 


0.052 


0.132 




1:3 12.500 


0.022 


0.043 




i - 1 ,562.500 


0.005 


0.024 




1:500 


1 526 


1.790 




1 :2.50O 


0.832 


! 177 


CD 43596. '1 


1:12.500 


0.247 


0.452 




1:62.500 


0.050 


0.242 




1:312.500 


0.010 


0.067 




1:1.562.500 


o.ooo 


0 036 




..1:500 


1.702 


1.505 




1:2.500 


0.706 


0.866 


CD 43596. ' 7 


1 : 1 2.500 


0.250 


0.282 




! 02.500 


0.039 


0078 




1:312.500 


0.002 


0.017 




1:1.562.500 


0.000 


0.010- 




1:500 


0.142 


0.309 




1:2.500 


0.032 


0.077 


I'rcimmunc laY 


1:12.500 


0.006 


0.024 




1:62.500 


0.002 


0.012 




1:312.500 


0.004 


0.010 




1: 1.562.500 


0.002 


0.014 



EXAMPLE 2 

Treatment Of t \ difficile Infection With Anti-C . difficile Antibodv 



15 



In order to determine whether the immune IgY antibodies raised against whole C. 
difficile organisms were capable of inhibiting the infection of hamsters by ( . difficile. 
hamsters infected by these bacteria were utilized. |l.yerly vt <//.. Infect. Immun.. 50:2215- 
2218 This example involved: (a) determination of the lethal dose ol ( difficile 

organisms: and (b) treatment ol infected animals with immune antibody or control antibody in 
nutritional solution. 
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a) Determination Of The Lethal Dose Of C. difficile Organisms 
Determination of the lethal dose of CV difficile organisms was carried out according to 
, .the- model described by D.M. Lyerly ei «/.. Infect. Immun.. 5.9:22 1 5-22 18 (1991). C. difficile 
strain ATCC 43596 (serogroup C. ATCC) was plated on BHI agar and grown anacrobically 
(BBl. Gas Pak 100 system) at 37°C lor 42 hours. Organisms were removed from the agar 
surface using a sterile dacron-tip swab and suspended in sterile 0.9% NaCI solution lo a 
density of 10" organisms/ml. 

In order to determine the lethal dose of ( /. difficile in the presence or control antibody 
and nutritional Ibrmula. non-immune eggs were obtained from unimmunized hens and a 12% 
PEG preparation made as described in Example 1(c). This preparation was redissolvcd in one 
fourth the original yolk volume of vanilla flavor Ensure*. 

Starting on day one. groups of female Golden Syrian hamsters (Harlan Sprague 
Daw ley). 8-9 weeks old and weighing V approximaiely 100 gm. were orall> administered I ml 
■of the preimmune/l-nsureK formula at time zero. 2 hours. 6 hours, and 10 hours. At 1 hour, 
animals were orally administered 3.0 mg clindamycin HC1 (Sigma) in I ml of water. This 
drug predisposes hamsters u> C. difficile infection by altering the normal intestinal flora. On 
day two. the animals were given 1 ml of the preimmune IgY/Ensure*,: formula at time zero. 2 
hours. 6 hours, and 10 hours. At I hour on day two. different groups of animals were 
inoculated orally with saline (control), or 10'. 10*. 10". or 10* C. difficile organisms in I ml 
of saline, f rom days 3-12. animals were given I ml of the preimmune IgY/Enstire * formula 
three lin.es daily and observed for the onset of diarrhea and death. Each animal was housed 
in an individual cage and was offered food and water ad libitum. 

Administration of 10" - 10* organisms resulted in death in 3-4 days while the lower 
doses of 10- - 1 0* organisms caused death in 5 days. Cecal swabs taken from dead animals 
indicated the presence of C '. difficile. Given the effectiveness of the 10' dose, this number of 
organisms was chosen for the following experiment to sec if hyperimmune ami-C. difficile 
antibody could blt>ck infection. 

b) Treatment Of Infected Animals With Immune Antibody Or 
Control Antibody In Nutritional Formula 

The experiment in (a) was repealed using three groups of seven hamsters each. Group 
A received no clindamycin or C. difficile and was the survival control. Group B received 
clindamycin. I0-* C. difficile organisms and preimmune IgY on the same schedule as the 
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animals in (a) above. Group C received clindamycin. U)- C. difficile organisms, and 
hyperimmune anti^C. difficile IgY on the same schedule as Group U. The anti-C difficile IgY 
was prepared as described in Example 1 except that the 12% PEG preparation was dissolved 
in one fourth the original yolk volume of Ensured. 

All animals were observed for the onset of diarrhea or other disease symptoms and 
death. Each animal was housed in an individual cage and was offered food and water ad 
lihiiitm. The results are shown in Table 6. 



TABLE 6 

The Effect Ol Oral Feeding Of Hyperimmune l«Y Antibodv on ( \ difficile Inferi i»n 


Animal Group 


Time To Diarrhea' 


Time To Death- 


A pre-immune IgY only 


no diarrhea 


no deaths 


H C lindamycin. (\ difficile, preimmune i»Y 


M) hrs 


4*> hrs. 


C C lindamycin, C* difficile, immune l»Y 


>.» hrs. 


56 hrs. 



Mean ol seven animals. 

I lamslers in the control group A did not develop diarrhea and remained healthy during 
the experimental period. Hamsters in groups B and C developed diarrheal disease. Anti-C 
difficile IgY did not protect the animals from diarrhea or death, all animals succumbed in the 
same lime interval as the animals treated with preimmune IgY. Thus, while immunization 
with whole organisms apparently can improve sub-lethal symptoms with particular bacteria 
(see U.S. Patent No. 5.080.895 to I I. Tokoro). such an approach does not prove to be 
productive to protect against the lethal effects of C. difficile. 

EXAMPLE 3 

Production of C. hotulimtm Type A Antitoxin in Hens 

In order to determine whether antibodies could be raised against the toxin produced by 
clostridial pathogens, which would be effective in treating clostridial diseases, antitoxin to ( 
hoiulinum type A toxin was produced. This example involves: (a) toxin modification: (b) 
immunization: (O antitoxin collection: (d) antigenicity assessment: and (e> assay of antitoxin 
liter. 
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a) Toxin Modification 

C hoiidinum type A toxoid was obtained from B. R. DasGupta; From this, the active 
type A neurotoxin (M.W. approximately 150 kD) was purified to greater than 99% purity, 
according to published methods. [B:R. DasGupta & V. Sathyamoorthy. Toxicon. 22:415 
(1984).] The neurotoxin was detoxified with formaldehyde according to published methods. 
|B.R. Singh & B.R. DasGupta. Toxicon. 27:403 (1989).) 

•>) Immunization 

( hoittlimtm toxoid lor immunization was dissolved in PBS ( I mg/ml) and was 
emulsified with an approximately equal volume of CPA (GIBCO) for initial immunization or 
II- A for booster immunization. On day zero, two white leghorn hens, obtained from local 
breeders, were each injected at multiple sites (intramuscular and subcutaneous) with 1 ml 
inactivated toxoid emulsified in I ml CFA. Subsequent booster immunizations were made 
according to the following schedule for day of injection and toxoid amount: days 14 and 21 - 
0.5 ,„g: day 171 - 0.75 mg: days 394. 401. 409 - 0.25 mg. One hen received an additional 
booster of 0. 1 50 mu on dav 544. 



«■')• Antitoxin Collection 

Total yolk immunoglobulin (IgY) was extracted as described in F.xamplc 1(c) and the 
20 |gY pellet was dissolved in the original yolk volume of PBS wiih thimerosal. 

d) Antigenicity Assessment 

Kggs were collected from day 409 through day 423 to assess whether the toxoid was 
sufficiently immunogenic to raise antibody. F.ggs from the two hens were pooled and 
antibody was collected as described in the standard .PEG protocol. |Tixample l(c).| 
Antigenicity of the botulinal toxin was assessed on Western blots. The 150 kD detoxified 
type A neurotoxin and unmodified, toxic. 300 kD botulinal type A complex (toxin used for 
intragastric route administration lor animal gut neutralization experiments; see Example 6) 
uere separated on a SDS-polyacr> lamide reducing gel. The Western blot technique was 
performed according to the method of Tovvbin. [II. Towbin ci a/.. I'roc. Natl. Acad. Sci. 
l/SA. 76:4350 (1 979). | Ten ug samples of C hniulinum complex and toxoid were dissolved 
in SDS reducing sample buffer (1% SDS. 0.5% 2-mercaptoethanol. 50 mM I ris, pi I 6.8. 10% 
glycerol. 0.025% w/v bromphenol blue. 10% ()-mercaptoeihanol). heated at 95°C for 10 mm 
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and separated on a I mm . thick 5% SDS-poIyacn'!a.T,ide gel. [K. Weber and M 
OsMr^Proteins and Sodium Dodecyl Sulfate? Molecular Wei.Ht Determination on 
PolyacrytamidcXiels and Related Procedure," i„ The Proteins. 3d Edition ,H Neurath & 
R L. Mill. eds). pp. ,79-223. (Academic Press. NY. 1075)., Par, of the uel was cut off and 
the proteins were stained with Coomassie Blue. The proteins in ,he remainder of the eel were 
transterred ,o nitrocellulose using the Mil.ib.ot-SDE electro-blotting svs,enT<Mil,i pore , 
accordmg to manufacturer s directions. The nitrocellulose was temporarily Gained with .0% 
« onceuu S fS.B. Carroll and A. Laughon. "Product, and Punficauon of Polyclonal 
Annbodies ,o the Foreign Se^en, of0- K aluc,osidase Fusion Proicins." in DNA C,„nin V t 
Practical Approach. Vol.111. (D. Glover, ed.). pp. 89-.,.. IRL Press. Oxford ,,987), ,o ' 
v.suahzc the lanes, then destained by running a gentle stream of distilled water over the blot 
«or several mmutcs. The nitrocel.u.ose was immersed in PUS containing 3% BSA overniuht 
M 4 C to block any remaining protein binding sites. 

The bio, was cut into strips and each strip was incubated with the appropriate primarv 
unubod., The avian anti-r HotuUnum antibodies fdescribed in ,c,| and pre-.mmune chicken 
am.budy.ias control, were diluted 1:125 in PUS containing , mg/m, USA lor 2 hours a, room 
temperature. The blots were washed with two changes each of large volumes of PBS BBS- 
I ween and PBS. successively < , 0 min/wash). Goat anti-chicken IgG alkaline phosphatase 
conjugated secondary antibody (Fisher Biotech, w as di.uted 1:500 in PBS contain^ , „ lu/m , 
-0 BSA and incubated with the b.o, for 2 hours at room temperature. The blots were washed 

- „h two changes each of large volumes of PBS and BBS-Twecn. followed hv one chan.e of 
PBS and (,. , M Tris-I ICI. pH 9.5. Blots were developed in frcshh prepared alkaline " 
Phosphatase substrate buffer , ,00 M g/ m | nilrobiue lctrazomim (Sj „ m;i) 5() ^ 
chloro-3-.ndolyl phosphate (Sigma,. 5 mM MgCU in 50 mM Na,CO v pll 9.5). 
^ The Western blots are shown in f igure I. The anti-C h,u U ,im,m IgY reacted to the 

toxoid to give a broad immunoreactive band at about 145-150 kD on the reducin- .. c l This 
toxoid is refractive to disullide cleavage by reducing agents due to formalin cross.inking The 
immune IgY reacted with the active toxin complex, a 97 kD ( /„„„,„„„„ tvpe A beavv chain 
and a ,3 kD ligh, chain . Tne preimmune Ig Y was w ^ ( . ^ ^ 

toxoid in the Western blot. 



15 
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c) Antitoxin Antibody Titer 

The IgY antibody titer to C. hoiulimm type A toxoid of eggs harvested between day 
4(W and 423 was evaluated by ({LISA, prepared as follows. Nineiy-six-well Falcon Pro-bind 
plates were coated overnight at 4°C with 100 ul/well toxoid [B R. Singh & B.R. Das Gupta. 
Toxicon 27:403 fl98<»| at 2.5 ug/ml in PBS. pH 7.5 containing 0.005% thimerosal. The 
following day the wells were Mocked with PBS containing 1% BSA for I hour at 37°C. The 
IgY from immune or prcimmunc eggs was diluted in PBS containing 1% BSA and 0.05% 
Tween 20 and the plates were incubated for 1 hour at 37°C. The plates were washed three 
limes with PBS containing 0.05% Tween 20 and three times with PBS alone. Alkaline 
phosphatasc-conjugated goat-anti-chicken IgG (Fisher Biotech) was diluted 1:750 in PBS 
containing 1% BSA and 0.05% Tween 20. added to the plates, and incubated I hour at 37°C\ 
, The plates were washed as before, and p-niirophenyl phosphate (Sigma) at I mg/ml in 0.05 M 
Na,t*O v pi I Mo. 10 mM MgCI : was added. 

The results are shown in Figure 2. Chickens immunized with the toxoid generated 
high titers of antibody to the immunogen. Importantly, eggs from both immunized hens had 
signillcant anti-immunogen antibody liters as compared to prcimmunc control. eggs. The anli- 
C htmrihntm IgY possessed significant activity, to a dilution of 1:^3.750 or greater. 

EXAMPLE 4 

Preparation Of Avian ligg Yolk Immunoglobulin In An Orally Administrable Form 

In order to administer avian IgY antibodies orally to experimental mice, an effective 
delivery formula for the IgY had to be determined. The concern was that if the crude IgY 
was dissolved in PBS. the saline in PBS would dehydrate the mice, which might prove 
harmful over ihe duration of the study. Therefore, alternative methods of oral administration 
of IgY were tested. The example involved: (a) isola-lion of immune IgY: (h) solubilization 
of IgY in water or PBS. including subsequent dialysis of the IgY-PBS solution with water to 
eliminate or reduce the salts (salt and phosphate) in the buffer; and (c) comparison of the 
quantity and aelivity of recovered IgY by absorbance at 280 nm and PACil:. and enzyme- 
linked immunoassay (EL1SA). 
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a) Isolation Of Immune IgY 

In order to investigate the most effective delivery formula for IgY. we used IgY which 
was raised again* C,o,a,us Jurissus .ernficus venom Three eggs werie collected from hens 
•mmunized with the C. UuHssas ,cw///,,wenom and IgY was extracted from the v« lks usinu 
•he ,„od,hed Poison procedure described by Thalley and Carroll IRio/Technolouv. 8:914-918 
( I99())| as described in Example 1(c). 

The egg yolks were separated from the vvhi.es. pooled, and blended with four volumes 
of PBS. Powdered PEG 8000 was added to a concentration of 3.5%. The mixture w-,s 
centrifugal at .0.000 rpm for .0 minu.es to pellet the precipi.a.ed protein, and the 
supernatant was filtered through cheesecloth ,o remove the lipid lav*. Powdered PEG 8000 
was added to .he supernatant ,o bring ,he final PEG concentration to .2% (assuming a PEG 
co„centra„on of 3.5% i„ .he supernatant,. The 12% PEG/lgY mixture was divided into two 
equal volumes and centrifuged to pellet the IgY. 

I») Solubilization Of The lj»Y In Water Or PBS 

One pellet was rcsuspended in 1/2 the original yolk volume of PBS. and the other 
Pellet was rcsuspended i„ 1/2 .he original yolk volume of water. The pellets were then 
cenm.uged.to remove any particles or insoluble material. The .gY in PBS solution dissolved 
readily but the fraction rcsuspended in water remained cloudy. 

In order to satisfy anticipated sterility requirements for orally administered antibodies 
-he anuhodv solution needs to be Hltcr-sterilizcd (as an alternate c to heat steri.izauon which 
would destroy the antibodies). The preparation of IgY rcsuspended in water was too cloudv 
i« pass through cither a 0.2 or 0.45 urn membrane litter, so .(, m , „r the PBS rcsuspended' 
•raeuon was d.alyzed overnight a, room temperature againsl 250 m , of water. The follow™ 
mornrng the dialysis chamber uas emptied and refilled with 250 ml of fresh II. () for a second 
u.alysis. Thereafter, the yields of soluble antibody were determined a. OD,, and arc- 
compared in fable 7. 
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TABLE 7 



Fraction 


Absorbance Of 1:10 Dilution At 280 nm 


Percent Recover 


PBS dissolved 


1.149 




l! ; 0 dissolved 


0.706 


61% 


PBS dissolved H.O dialyzed 


0.885 


77% 



Resuspending the pellets in PBS followed by dialysis against water recovered more 
amibody than directly resuspending the pellets in water (77% versus 61%). liquivalcnt 
volumes of the Ig Y preparation in PBS or water were compared by PAGE, and these results 
were in accordance with the absorbance values (data not shown). 



c) Activity Of ! S Y Prepared With Different Solvents 

An hi. ISA was performed to compare the binding activity of the IgY extracted by 
each procedure described above. C. Juries terrifies «'.</./.> venom at 2'5 ng/ml in PUS 
was used to coa, each well of a <;6-well microliter plate. The remaining protein binding sites 
were blocked with PBS containing 5 mg/ml BSA. Primary antibody dilutions (in PBS 
containing I mg„nl BSA) were added in duplicate. After 2 hours of incubation at room 
temperature, the unbound primary antibodies were removed by washing the wells with PBS. 
BBS-Tween. and PBS. The species specific secondary antibody (goat anti-chicken 
immunoglobulin alkaline-phosphatase conjugate (Sigma) was diluted 1:750 in PBS containing 
I mg/ml BSA and added to each well of the microliter plate. After 2 hours of incubation at 
mom temperature, the unbound secondary antibody was removed by washing the plate as 
before, and freshly prepared alkaline phosphatase substrate (Sigma) ai I mg/ml in 50 mM 
Na,CO, 10 mM MgCK. pll 9.5 was added to eaci, well. The color development was 
measured on a Dynatech MR 700 microplate reader using a 412 nm filter. The results are 
shown in Table 8. 

The binding assay results parallel the recovery values in Table 7. with PBS -dissolved 
IgY showing slightly more activity than the PBS-dissolved/l 1,0 dialyzed amibody. The 
water-dissolved' antibody had considerably less binding activity than the other preparations! 
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EXAMPLE 5 

Survival Of Antibody Activity After Passage Through The Gastrointestinal Traet 

In order to determine the feasibility of oral administration of antibody, it was of 
interest to determine whether orally administered IgY survived passage through the 
gastrointestinal tract. The example involved: (a, oral administration of specific immune 
antibody mixed with a nutritional formula: and <h, assay of antibody activity extracted from 
feces. 

TABLE 8 



Dilution 


Preimmunc 


PBS Dissolved 


H : 0 Dissolved 


PBS/H,Q 


1:500 


0005 


1.748 


1.577 


1.742 


1:2.500 


0.004 


0.644 


0.340 


0.606 


1:12.500 


0.001 


0.144 


0054 


tuwo 


1:62.500 


0.001 


0.025 


■0.007 


0 016 


1:312.500 


0.010 


0.000 


0.000 


0.002 



a) Oral Administration Of Antibody 

The IgY preparations used in this example are the same PHS-dissolved/l KO dialyzcd 
amivenom materials obtained in Kxamplc 4 above. mixed with an equal volume of Hnfamil*. 
Two mice were used in this experiment, each receiving a dilTereni diet as follows: 

! ) water and food as usual: 

:> immune IgY preparation dialyzcd against water and mixed 1:1 will, luifamikR. 
CThe mice were given the corresponding mixture as their only source of food and water). 

•>) Antibody Activity After Ingestion 

After both mice had ingested their respective fluids, each tube was refilled with 
approximately 10 ml of the appropriate fluid first thing in the morning. » y mid-morning 
there was about 4 to 5 ml of liquid left in each tube. At this point stool samples were 
collected from each mouse, weighed, and dissolved in approximately 500 M | PBS per 100 mg 
stool sample. One hundred and sixty mg of control stools (no antibody) and W mg of 
experimental stools (specific antibody) in 1.5 ml microfuge tubes were dissolved in 800 and 
500 ul PBS. respectively. The samples were heated at 37°C for 10 minutes and vortcxed 
vigorously. The experimental stools were also broken up with a narrow spatula. Each sample 
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was centrifueed for 5 minutes in a microfuge and the supernatants. presumably containing the 
-antibody extracts, were collected. The pellets were saved at 2-8°C in case future extracts 
were needed. Because the supernatants were tinted, they were diluted five-fold in PBS 
containing I mg/ml BSA for the initial dilution in the enzyme immunoassay (F.LISA). The 
primary extracts were then diluted live-fold- serially from this initial dilution. The volume of 
primary extract added to each well was 190 jiL The ELISA was performed exactly as 
described in Example 4. 



TABLE 9 



10 



Dilution 


Preimmune IgY 


Control Fecal Extract 


liXP. Fecal fc-xtract 


1:5 


0 


0.0(H) 


0.032 


1:25 


0.016 


- 0 


0.016 


1:125 


0 


0 


0.009 


1:625 


0 


* 0.00 .1 


0.00 1 


1:3 125- 


<> 


<) 


0.000 



15 



20 



I here v\as some active antibody in the fecal extract from the mouse given the specific 
antibody in [{nfamii* formula, but it was present at a very low level. Since the samples were 
assayed at an initial 1 :5 dilution, the binding observed could have been higher with less dilute 
samples. '■Consequently, the mice were allowed to continue ingesting either regular food and 
water or the specific IgY in Enfamil* formula, as appropriate, so the assay could be repeated. 
Another 1:1. ISA plate was coated overnight with 5 ^tg/ml of C.tU. venom in PBS. 

The following morning the HLISA plate was blocked with 5 my/ml BSA. and the focal 
samples were extracted as before, except that instead of heating the extracts at 37°C. the 
samples were kept on ice to limit proteolysis. The samples were assayed undiluted initially, 
and in 5X serial dilutions thereafter. Otherwise the assay was carried out as before. 
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TABLE 10 



Specifi 


c Antibody Survives Passage Through The Gastroin ies, in., tv... 


Dilution 
undiluted 


Preimmune IgY 
0.003 


Control Extract 
0 


Exp. Extract 

0.379 


1:5 


-0 


• 0 


0.071 


1:25 


0.000 


0 


0.027 


1:125 


0.003 


0 


0.017 


1:625 


0.000 


• 0 


0.008 


1:3125 


0.002 


0 


0002 



The experiment confirmed the. previous results, with the anlibodv activitv markcdlv 
higher. The control fecal extract showed no anti r./., activity, even undiluted." while the 
local extract from the anti-< .</./. l B Y/Enlhnii|«.fcd mou.se showed considerable a„ti- ( t // 
activtv This experiment (and the previous experiment, clearly demonstrate that active |»Y 
annbody surv ives passage through the mouse digestiv e tract, a 'finding with favorable 
implications tor the success of IgY antibodies administered orally as a therapeutic or 
prophylactic. 



EXAMPLE 6 

In l ivn Neutralization Of Type C. hoiiilmum 
Type A Neurotoxin By Avian Antitoxin Antibody , 

This example demonstrated the ability of HXi-purilled antitoxin, collected as 
described in Example 3. to neutralize the lethal effect off. hnmlinuni neurotoxin tvpe A in 
mice. To determine the oral lethal dose <1.D 1U0 , of toxin A. groups of BALB/c mice were 
given different doses of toxin per unit body weight (average body weight of 24 urams > 1 or 
oral administration, toxin A complex, which contains the neurotoxin associated with other 
non-toxin proteins was used. This complex is markedly more toxic than purified neurotoxin 
when given by the oral route. |l. Ohishi ft «/.. Infect. Immun.. 16. 106 , 1077).) ( /„„„//,„„„ 
toxin type A complex, obtained front l-ric Johnson (University Of Wisconsin. Madison, was 
250 ug/ml in 50 mfvl sodium citrate, pi I 5.5. specific toxici.y 3 * |rj' ,„ouse LD^nn with 
parenteral administration. Approximately 40-50 ng/gm body weight was usually latal\vi,hin 
4K hours in mice maintained on conventional food and water. When mice were uiven a diet 
adllhiium of only EnfamilQO the concentration needed to produce lethality was approximately 

- 58 - 



WO 98/08540 



PCT/US97/15394 



10 



15 



2.5 limes higher (125 ng/gm body weight). Botulinal loxin concentrations of approximately 
200 ng/gm body weight were fatal in mice fed Enfamil® containing preimmune IgY 
(resuspcnded in Enf'amil® at the original yolk volume): 

The oral LD, IM1 of C ■'. hotulinum toxin was also determined in mice that received 
known amounts of a mixture of preimmune IgY-Ensure«) delivered orally through fecdinc 
needles. Using a 22 gauge feeding needle, mice were given 250 ul each of a preimmune 
lgY-Ensure« mixture (preimmune IgY dissolved in 1/4 original yolk volume) I hour before 
and 1/2 hour and 5 hours after administering botulinal toxin, l oxin concentrations given 
orally ranged from approximately 12 to 312 ng/gm body weight (0.3 to 7.5 ug per mouse). 
Hotuiinal toxin complex concentration of approximately 40 ng/gm body weight (1 ui: per 
mouse) was lethal in all mice in jess than 36 hours. 

Two groups of BALB/c mice. 10 per group, were each given orally a single dose of 1 
tig each of botulinal toxin complex in 100 ul of 50 mM sodium citrate P H 5.5. The mice 
received 250 ul treatments of a mixture of either preimmune or immune IgY in Ensure*; (1/4 
original > oik volume) I hour before and 1/2 hour. 4 hours, and 8 hours after botulinal toxin 
administration. The mice received three treatments per day lor two more days. The mice 
were observed for 96 hours. The survival and mortality are shown in (able II. 

table: ii 



Toxin Dose ng-um 


Ami body Type 


Number Of Mice Alive 


Number Of Mice Dead 


4 1 .ft 


mm- immune 


0 


10 


4 1 .ft 


iinti-bouilmal toxin 


10 


(i 



All mice treated with the preimmune lgY-F.nsure« mixture died within 46 hours post- 
toxin administration. The average time of death in the mice was 32 hours post toxin 
administration. .Treatments of preimmune IgY-Ensure* mixture did not continue beyond 24 
hours due to extensive paralysis of the mouth in mice of this group. In contrast, all ten mice 
treated with the immune anti-botulinal toxin IgY-Ensureue mixture survived past 96 hours. 
Only 4 mice in this group exhibited symptoms of botulism toxicity (two mice about 2 days 
alter and two mice 4 days after toxin administration). These mice eventually died 5 and 6 
days later. Six of the mice in this immune group displayed no adverse effects to the toxin 
and remained alive and healthy long term. Thus, the avian anti-botulinal toxin antibody 
demonstrated very good protection from the lethal effects of the toxin in the experimental 
mice. 
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EXAMPLE 7 

Production Of An Avian Antiloxin Against Clostridium difficile Toxin A 

loxin A is a potent cytotoxin secreted by pathogenic strains of ( \ difficile, that plays a 
direct role in damaging gastrointestinal tissues. In more severe cases of C. difficile 
intoxication, pseudomembranous colitis can develop which may be fatal. This would be 
prevented by neutralizing the effects of this toxin in the gastrointestinal tract. As a first step, 
antibodies were produced against a portion of the toxin. The example involved: (a) 
conjugation of a synthetic peptide of toxin A to bovine serum albumin: (b) immunization of 
hens with the pcptide-BSA conjugate: and (c) detection of antitoxin peptide antibodies by 
LT.ISA. 

a) Conjugation Of A Synthetic Peptide Of Toxin A To Bovine 
Serum Albumin 

The synthetic peptide CQTIDGKKYYFN-NI I, (SliQ ID NO:82) was prepared 
commercial^ (Multiple Peptide Systems. San Diego. CA) and validated lo be .-80% pure by 
high-pressure liquid chromatography. The eleven amino acids following the cysteine residue 
represent a consensus sequence of a repeated amino acid sequence found in loxin A. (Wren 
ci al.. Infect. I mmun.. 59:3151-3155 (I991).| The cysteine was added to facilitate 
conjugation to carrier protein. 

I iv order lo prepare the carrier for conjugation. 13SA (Sigma) was dissolved in 0.01 M 
NaP0 4 . pi I 7.0 to a final concentration of 20 mg/ml and n-maleimidohenzoyl-N- 
hydroxy.succininiide ester (MBS. Pierce) was dissolved in N.N-dimeihyl foimamide lo a 
concentration of 5 mg/ml. MBS solution. 0.51 ml. was added to 3.25 ml of the BSA solution 
and incubated for 30 minutes at room temperature with stirring every 5 minutes. The MBS- 
activatcd BSA was then purified by chromatography on a Bio-Gel P-10 column (Bio-Rad: 40 
ml bed volumo equilibrated with 50 mM NaPC.) 4 . pll 7.0 buffer. Peak fractions were pooled 
(6.0 ml). 

l.yophilized toxin A peptide (20 mg) was added to the activated BSA mixture, stirred 
until the peptide dissolved and incubated 3 hours al room temperature. Within 20 minutes., 
the reaction mixture became cloudy and precipitates formed. After 3 hours, the reaction 
mixture was centrifuged at 10.000 x g for 10 min and the supernatant analyzed for protein 
content. No significant protein could be detected at 280 nm. The conjugate precipitate was 
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washed three times with PBS and stored at 4«C A second conjugation was performed with 
15 mg of activated BSA and 5 mg of peptide and the conjugates pooled and suspended at a 
peptide concentration of 10 mg/ml in 10 itjM NaP0 4 . pH 7:2. i 

l») Immunization Of Hens With Peptide Conjugate 

Two hens were each initially immunized on day zero by injection into two 
subcutaneous and two intramuscular sites with I mg of peptide conjugate that was emulsified 
in CFA (GIBCO). The liens were boosted on day 14 and day 21 with I mg of peptide 
conjugate emulsiiled in I FA (GIBCO). 

e) Detection Of Antitoxin Peptide Antibodies By ELISA 

lg Y was purified from two eggs obtained before immunization ( pre-immune) and two 
eggs obtained 31 and 32 days after the initial immunization using PEG fractionation as 
described in lixamplc 1. 

Wells of a 96-well microliter plate (Falcon Pro-Bind Assay Plate) were coated 
overnight at 4°C with 100 ug/ml solution of the toxin A synthetic peptide in PBS. pH 7.2 
prepared by dissolving I mg of the peptide in 1.0 ml of H.O and dilution of PBS. The pre- 
immune and immune IgY preparations were diluted in a live-fold" scries- in a buffer containing 
1% PliG 8000 and 0.1% Twccn-20 (v/v) in PBS. pH 7.2. The wells were blocked for 2 
hours at room temperature with 1 50 M l of a solution containing 5% (v/v) Carnation* nonfat 
dry milk and 1% PEG 8000 in PBS. pi! 7.2. After incubation for 2 hours at room 
temperature, the wells were washed, secondary rabbit anti-chicken IgG-alkaline phosphatase 
(1750) added, the wells washed again and' the color development obtained as described in 
Fxample 1 The results are shown in Table 12. 

TABLE 12 



Dilution Of I'iiG Prep 


Absorbance Ai 410 nm 


Preimmune 


Immune Anti-Peptide 


1:100 


0.0 13 


0.253 


1:500 


0.004 


0.O39 


1:2500 


0.004 


O.005 



Clearly, the immune antibodies contain titers against this repeated epitope of tox 



toxin A. 
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EXAMPLE 8 

Production Of Avian Antitoxins Against 
, Clostridium difficile Native Toxins A And B 

To determine whether avian antibodies are effective for the neutralization of C 
difficile toxins, hens were immunized using native C. difficile toxins A and B. The resulting 
egg yolk antibodies were then extracted and assessed for their ability to neutralize toxins A 
and B i„ vino. The Example involved (a) preparation of the toxin immunogens. (b) 
immunization, (c, purification of the antitoxins, and <d> assay of toxin neutralization activity. 

a) Preparation Of The Toxin Immunogens 

Both C difficile native toxins A and B. and C. difficile toxoids, prepared by the 
treatment of the native toxins w ith formaldehyde, were employed as immunogens. C. difficile 
toxoids A and B were prepared by a procedure which was modillcd from published methods 
(Enrich ci al.. Infect. Immun. 28:1041 (1980). Separate solutions (in PBS) of native C 
difficile toxin A and toxin B (Tech Lab) were each adjusted to a concentration of 0.20 mg/ml. 
and formaldehyde was added to a final concentration of 0.4%. The toxin/formaldehyde 
solutions were then incubated at 37°C for 40 hrs. free formaldehyde was then removed from 
the resulting toxoid solutions by dialysis against PBS at 4°C. In previously published reports, 
this dialysis step was not -performed. Therefore, free formaldehyde must have been present in 
their toxoid preparations. The loxoid solutions were concentrated, using a Ccntriprep 
concentrator unit (Amicon). to a final toxoid concentration of 4.0 mg/ml. The two resultinii 
preparations were designated as toxoid A and toxoid B. 

C difficile native toxins were prepared ly concentrating stock solutions of toxin A and 
toxin 13 (lech Lab. Inc). using Ccntriprep concentrator units (Amicon). to a final 
concentration of 4.0 mg/ml. 

b) Immunization 

The first two immunizations were performed using the toxoid A and toxoid B 
immunogens described above. A total of 3 different immunization combinations were 
employed, l or the first immunization group. 0.2 ml of toxoid A was cmulsillcd in an equal 
volume of Tiler Max adjuvant (CytRx). l iter Max was used in order to conserve the amount 
of immunogen used, and to simplify the immunization procedure. This immunization group 
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was designated "CTA." For the second immunization group. 0.1 ml of toxoid B was 
emulsified in an equal volume of Titer Max adjuvant. This group was designated "CTB." 
For the third immunization group. 0.2 ml of toxoid A was first mixed with 0.2 ml of toxoid 
n. and the resulting mixture was emulsified in 0.4 ml of Titer Max adjuvant. This group was 
designated "CTAB." In this way. three separate immunogen emulsions were prepared, with 
each emulsion containing a final concentration of 2.0 mg/ml of toxoid A (CTA) or toxoid B 
(CTB) or a mixture of 2.0 mg/ml toxoid A and 2.0 mg/ml toxoid B (CTAB). 

On day 0. White Leghorn hens, obtained from a local breeder, were immunized as 
lollows: Group CTA. Four hens were immunized, with each hen receiving 200ug of toxoid 
A. via two intramuscular (I.M.) injections of 50ul of CTA emulsion in the breast area. 
Group CTB One hen was immunized with 200ug of toxoid B. via two I.M. injections of 
50 M I of CTB emulsion in the breast area. Group CTAB. l our hens were immunized, with 
each hen receiving a mixture containing 200ug of toxoid A and 200ug of toxoid B. via two 
I.M. injections of lOOul of CTAB emulsion in the breast area. The second immunization was 
performed 5 weeks later, on day 35. exactly as described for the first immunization above. 

In order to determine whether hens previously immunized with C. difficile toxoids 
could tolerate subsequent booster immunizations using native toxins, a single hen from group 
CTAB was immunized for a third time, this lime using a mixture of the native toxin A and 
native toxin B described in section (a) above (these toxins were not formaldehyde-treated, and 
were used in their active form). This was done in order to increase the amount (titer) and 
affinity ol specific antitoxin antibody produced by the hen over that achieved by immunizing 
with toxoids only. On day 62. 0.1 ml of a toxin mixture was prepared which contained 
2()()ug of native toxin A and 200ug of native toxin B. This toxin mixture was then 
emulsified. 'in 0.1 ml of Titer Max adjuvant. A single CTAB hen xvas then immunized with 
the resulting immunogen emulsion, via two I.M. injections of lOOul each, into the breast area. 
This hen was marked with a wing band, and observed for adverse effects for a period of 
approximately I week, after which lime the hen appeared to be in good health. 

Because the CTAB hen described above tolerated the booster immunization with native 
toxins A and B with no adverse effects, it was decided to boost the remaining hens with 
native toxin as well. On day 70. booster immunizations were performed as follows: Group 
CTA. A 0.2 ml volume of the 4 mg/ml native toxin A solution was emulsified in an equal 
volume of Titer Max adjuvant. Each of the 4 hens was then immunized with 200ug of native 
toxin A. as described for the toxoid A immunizations above. Group CTB. A 50ul volume 
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of the 4 mg/ml native toxin B solution was emulsified in an equal volume of Titer Max 
adjuvant. The hen was then immunized with 200ug of native toxin B. as described for the 
toxoid B immunizations above. Croup CTAB. A 0.15 ml volume of the 4 mg/ml native 
toxin A solution was first mixed with a 0.15 ml volume the 4 my/ml native toxin B solution 
The resulting toxin mixture was then emulsified in 0.3 ml of Tiler Max adjuvant. The 3 
remaining hens (the hen with the wing band was not immunized this time) were then 
immunized with 200ug of native toxin A and 200ug of native toxin B as described for the 
toxoid A . toxoid B immunizations (CTAB) above. On day 85. all hens received a second 
booster immunization using native toxins, done exactly as described for the first boost with 
native toxins above. 

All hens tolerated both booster immunizations with native toxins with no adverse 
effects. As previous literature references describe the use of formaldehyde-treated toxoids, 
this is apparently the f.rst time that any immunizations have been performed using native ( . 
difficile toxins. 

c) Purification Of Antitoxins 

l-ggs were collected from the hen in group CTB 10-12 days following the second 
immunization with toxoid (day 35 immunization described in section (b) above), and from the 
hens in groups CTA and CTAB 20-21 days following the second immunization with toxoid. 
To be used as a pre-immune (negative) control, eggs xverc also collected from unimmunized 
hens from the same Hock. F.gg yolk immunoglobulin (IgY) was extracted from the 4 groups 
of eggs as described in Example 1 (c). and the final IgY pellets were solubilizcd in the 
original yolk volume of PBS without thimerosal. Importantly, thimerosal was excluded 
because it would have been toxic to the CHO cells used in the toxin neutralization assays 
described in section (d) below. 

d) Assay Of Toxin Neutralization Activity 

The toxin neutralization activity of the IgY solutions prepared in section (c) above was 
determined using an assay system that was modified from published methods. |Rhrich el a/.. 
Infect, lmmun. 28: 1041-1043 (1992): and McCiec el at. Microb. Path. 12:333-341 ( I992).| 
As additional controls, affinity-purified goat anti-C difficile toxin A (Tech l ab) and affinity- 
purified goat anti-C. difficile toxin B (Tech Lab) were also assayed for toxin neutralization 
activity. 
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The IgY solutions and goat antibodies were serially diluted using F 12 medium 
(GJBCO) which was supplemented with 2% PCS (GlBCO)f this solution will be referred to as 
, "medium" lor the remainder of this Example). The resulting antibody solutions were then 
.nixed with a standardized concentration of either native ( difficile toxin A (Tech Lab), or 
native ( . difficile toxin B (Tech Lab), at the concentrations indicated below: Following 
incubation at 37-C I or 60 min.. lOOul volumes of the toxin - antibody mixtures were added 
io the wells of %-well microliter plates (Falcon Microtest III) which contained 2.5 * 10' 
C hinese Hamster Ovary (CHO) cells per well (the CHO cells were plated on the previous day 
to allow them to adhere to the plate wells). The- final concentration of toxin, or dilution of 
antibody indicated below refers to the Una! test concentration of each reagent present in the 
respective microtiter plate wells. Toxin reference wells were prepared which contained CHO 
.colls and toxin A or toxin B at the same concentration used for the toxin plus antibody 
mixtures (these vvells contained no antibody). Separate control wells were also prepared 
uhich contained CMC) cells and medium only. The assay plates were then incubated for 18- 
24 hrs. in a 37°C. humidified. 5% CO. incubator. On the following day. the remaining 
adherent (viable) cells in the plate wells were stained using 0.2% crystal violet < Mallinckrodt) 
dissolv ed in 2% ethanol. lor 10 min. Excess stain was then removed by rinsing with water, 
and the stained cells were solubilizeu by adding 1 00f.il of I % SDN (dissolved in water) to 
each well. The absorbance of each well was then measured at 570 hm. and the percent 
cytotoxicity of each test sample or mixture was calculated using the following formula: 



% CHO Celt Cytotoxicity = |1 - j Abs - Sample y lQQ 

Abs. Control 



Unlike previous reports which quantitatc results v isually by counting cell rounding by 
microscopy, this Example utilized spectrophotometry methods to quantitatc the C. difficile 
toxin bioassay. In order to determine the toxin A neutralizing activity of the CTA. CTAB. 
and pre-immune IgY preparations, as well as the affinity-purified goal antitoxin A control/ 
dilutions of these antibodies were reacted against a 0. lug/ml concentration of native toxin A 
(this is the approx. 50% cytotoxic dose of toxin A in this assay system). The results are 
shown in figure 3. 

Complete neutralization of toxin A occurred with the CTA IgY (antitoxin A. above) at 
dilutions of 1:80 and lower, while significant neutralization occurred out to the 1:320 dilution. 
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The CTAB IgY (antitoxin A + toxin B. above, demonstrated complete neutralisation at the 
1:320-1:160 and lower dilutions, and significant neutralization occurred out to the 1:1280 
dilution. The commercially available affinity-purified goat antitoxin A| did not completely 
neutralize toxin A at any of the dilutions tested, but demonstrated significant neutralization 
out to a dilution of 1:1.280. The preiinmune IgY did no. show any toxin A neutralizing 
activity at any of the concentrations tested. These results demonstrate that IgY purified from 
eggs laid by h ens immunized with toxin A alone, or simultaneously with toxin A and toxin B 
is an effective toxin A antitoxin. 

The toxin B neutralizing activity of the CTAB and pre-immunc IgY preparations, and 
also the affinity-purified goat antitoxin B control was determined by reacting dilutions of 
these antibodies against a concentration of native toxin B of 0.1 ng/ml (approximately the 
50% cytotoxic dose of toxin B in the assay system). The results arc shown in figure 4. 

C omplete neutralization of toxin B occurred with the CTAB IgY (antitoxin A * toxin 
B. above, at the 1.40 and lower dilutions, and significant neutralization occurred out to the 
1 :32() dilution. The affinity-purified goat antitoxin B demonstrated complete neutralization a. 
dilutions of 1:640 and lower, and significant neutralization occurred out to a dilution of 
1:2.560. The preimmunc IgY did not show any toxin B neutralizing activity at anv of the 
concentrations tested. These results demonstrate that IgY purified from eggs laid by hens 
immunized simultaneously with toxin A and toxin B is an effective toxin B antitoxin. 

In a separate study, the toxin B neutralizing activity of CTB. CTAB. and pre-immunc 
IgY preparations was determined by reacting dilutions of these antibodies against a native 
toxin B concentration of 0.1 ng/ml (approximately 100% cytotoxic dose of toxin B ,n this 
assay system). The results are shown in Figure 5. 

Significant neutralization of toxin B occurred with the CTB IgY (antitoxin B. above) 
at dilutions of 1:80 and lower, while the CTAB IgY (antitoxin A . toxin B. above, was found 
to have significant neutralizing activity at dilutions of 1:40 and lower. The preimmunc IgY 
did not show any toxin B neutralizing activity at any of the concentrations tested. These 
results demonstrate that IgY purified from eggs laid by hens immunized with toxin B alone, 
or simultaneously with toxin A and loxin B. is an effective toxin B antitoxin. 
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EXAMPLE 9 

In vivo Protection Of Golden Syrian Hamsters From 
C. difficile Disease By Avian Antitoxins Against C '. difficile 'Toxins A And B 

The most extensively used animal model to study ( '. difficile disease is the hamster. 
ILycrly e, <//.. Infect, hnmun. 47:340-352 (1992).] Several other animal models for 
antibiotic-induced diarrhea exist, but none mimic the human form of the disease as closely as 
Hie hamster model, [R. Fekety. "Animal Model., of Amihio.ic-lnduced Coluis." in O. Zak and 
M. Sande (eds.). Experimental Models in Antimicrobial Chemodierapy. Vol. 2. p P :6l-72. 
( 1986). | U, this model, the animals are first predisposed to the disease by the oral 
administration of an antibiotic, such as clindamycin, which alters the population of normally- 
occurring gastrointestinal flora (Fekely. at 61-72). Following the oral challenge of these 
animals with viable ( difficile organisms, the hamsters develop cecitis. and hemorrhage 
ulceration, aiid inflammation are evident in the intestinal mucosa. [I.yerly el al.. Infect. 
Immuri. 47:340-352 ( 1985). | The animals become lethargic, develop severe diarrhea, and a 
high percentage of them die from the disease. |Lyerly et al. Infect. Immun. 47:349-352 
(1985)1 This model is therefore ideally suited for the evaluation of therapeutic agents 
designed ftir the treatment or prophylaxis of C. difficile disease. 

The ability of the avian ( difficile antitoxins, described in lixample I above, to 
protect hamsters IhwC. difficile disease was evaluated using the Golden Syrian hamster 
model of ( \ difficile infection. The F.xample involved (a) preparation of the avian C. difficile 
antitoxins, (b) in vivo protection of hamsters from ( . difficile disease by treatment with avian 
antitoxins, and (c) long-term survival of treated hamsters. 

a) Preparation Of The Avian C. difficile Antitoxins 

Jiggs were collected from hens in groups CTA and CTAB described in Example 1 (b) 
above. To be used as a pre-immune (negative) control, eggs were also purchased from a local 
supermarket, ligg yolk immunoglobulin (IgY) was extracted from the 3 groups of eggs as 
described in lixample I (cj. and the final IgY pellets were solubihzod in one fourth the 
original yolk volume of Ensured nutritional formula. 
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b) In vivo Protection Of Hamsters Against C. difficile Disease By 
Treatment With Avian Antitoxins 

The avian C difficile antitoxins prepared in section (a) above were evaluated for their 
ability to protect hamsters from f. difficile disease using an animal model system which was 
modified from published procedures. (Fckciy. at 61-72: Borricllo at ul.. j. Med. Microbiol.. 
24:53-64 < 1987): Kim cl «/.. Infect, immun.. 55:2984-2992 ( 1987). Borriello el at.. J. Med. 
Microbiol.. 25:191-1% (1988): Dclmee and Avesani. J. Med. Microbiol,. 33J85-90 < 1990); 
and Lycrly c, at.. Infect. Immun.. 59:2215-2218 ( 1991 ).| l or the study, three separate 
experimental groups were used, with each group consisting of 7 female Golden Syrian 
hamsters (Charles River), approximately 10 weeks old and weighing approximately 100 gms. 
each. The three groups were designated "CTA." "CTAB" and Tre-immune." These 
designations corresponded to the ant.toxin preparations with which the animals in each group 
were treated. IZach animal was housed in an individual cage, and was offered food and water 
adlihUum through the entire length of the study. On day I. each animal was orally 
administered 1.0 ml of one of the three antitoxin preparations (prepared in section (a) above) 
at the following limcpoinis: 0 hrs.. 4 hrs.. and X hrs. On day 2. the day I treatment was 
repeated. On day 3. at the 0 hr. limepoini. each animal was again administered antitoxin, as 
described above. At 1 he. each animal was orally administered 3.0 mg of clindamycin-IICI 
(Sigma) in 1 ml of water. This treatment predisposed the animals to infection with C. 
difficile. As a control for possible endogenous C. difficile colonization, an additional animal 
from the same shipment (untreated) was also administered 3.0 mg of clindamycin-HCI in the 
same manner. This. clindamycin control animal was left untreated (and uninfected) for the 
remainder of the studs At the 4 hr. and 8 hr. timepo.nts. the animals were administered 
antitoxin as described above. On day 4. at the 0 hr. limepoini. each animal was again 
administered antitoxin as described above. At 1 hr.. each animal was orally challenged with 1 
ml of C. difficile inoculum, which contained approx. 100 C difficile strain 43596 organisms 
in sterile saline. C\ difficile strain 43596. which is a serogroup C strain, was chosen because 
it is representative of one of the most frequently-occurring serogroups isolated from patients 
with antibiotic-associated pseudomembranous colitis, [ftclmcc el al . J. Clin. Microbiol.. 
28:2210-2214 (1985).| In addition, this strain has been previously demonstrated to be virulent 
in the hamster model of infection. |Delmee and Avesani. .(. Med. Microbiol.. 33:85-90 
(1990). | At the 4 hr. and 8 hr. timepoints. the animals were administered antitoxin as 
described above. On days 5 through 13. the animals were adminisiercd antitoxin 3x per day 
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us described for day I above, and observed for the onset of diarrhea and death. On the 
morning of day 14. the final results of the study were tabulated. These results are shown in 
Table 13. 

Representative animals from those that died in the Pre .immune and CTA groups were 
nceropsied. Viable ( : difficile organisms were cultured from the ceca of these animals, and 
the gross pathology of the gastrointestinal tracts of these animals was consistent with that 
expected for ( . difficile disease (inflamed, distended, hemorrhagic cecum, tilled with watery 
diarrhea-like material). In addition, the clindamycin control animal remained healthy 
throughout the entire study period, therefore indicating that the hamsters used in the study had 
not previously been colonized with endogenous C difficile organisms prior to the start of the 
study, following the final antitoxin treatment on day 13. a single surviving animal from the 
CTA group, and also from the C I AH group, was sacrificed and nceropsied-.- No pathology 
was noted in either animal. 

TABLE 13 



: Treatment Group 


No. Animals Surviving 


No. Animals Dead 


I're-lmmune 


1 


6 


C TA (Antitoxin A only) 


5 


•* 


C'TAH (Antitoxin A - Antitoxin tt) 


7 


0 



Treatment of hamsters with orally-administered toxin A and toxin B antitoxin (group 
CTAli) successfully protected 7 out of 7 (100%) of the animals from < difficile disease, 
treatment of hamsters with orally-administered toxin A antitoxin (group CTA) protected 5 
but of 7 (71%) „f these animals from C. difficile disease. Treatment using pre-immune IgY 
was not protective against C. difficile disease, as only I out of 7 (14%) of these animals 
survived. These results demonstrate that the avian toxin A antitoxin and the avian toxin A 
toxin f) antitoxin effectively protected the hamsters from C. difficile disease. These results 
also suggest that although the neutralization of toxin A atone coolers some degree of 
protection against C. difficile disease, in order to achieve maximal protection, simultaneous 
antitoxin A and antitoxin B activity is necessary. 

c) Long-Term Survival Of Treated Hamsters 

It has been previously reported in the literature that hamsters treated with orally- 
administered bovine antitoxin IgC. concentrate are protected from ( difficile disease as loim 
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as the treatment is continued, but when the treatment is stopped, the animals develop diarrhea 
and subsequently die within 72 hrs. [Eyeriy a «/.. Infect. Immim.. 59(6):22 1 5-22 1 8 ( 1 99 i ). J 

In order to determine whether treatment of ( V difficile disease Using avian antitoxins 
promotes long-term survival following the discontinuation of treatment, the 4 surviving 
animals in group CTA. and the 6 surviving animals in group CI AB were observed for a 
period of 1 1 days (264 hrs.) following the discontinuation of antitoxin treatment described in 
section <b) above. All hamsters remained healthy through the entire post-treatment period. 
This result demonstrates that not only does treatment with avian antitoxin protect auainst the 
onset of C. difficile disease (/.,.. „ is effective as a prophylactic,, i, also promotes long-term 
survival beyond the treatment period, and thus provides a lasting cure. 

EXAMPLE 10 

In vivo Treatment Of Established C. difficile Infection In Golden 
Syrian Hamsters With Avian Antitoxins Against < \ difficile Toxins A And B 

The ability of the avian C difficile antitoxins, described in Example 8 above, to treat 
an established C difficile infection was evaluated using the Golden Syrian hamster model. 
The Example involved (a) preparation of the avian C difficile antitoxins, (b) /„ ,•„•„ treatment 
of hamsters with established C. difficile infection, and <e> histologic evaluation of cecal tissue. 

a) Preparation Of The Avian C. difficile Antitoxins 

Eggs were collected from hens in group CTA B described in Example X (b) above, 
which were immunized with < " difficile toxoids and native toxins A and B. Eggs purchased 
from a local supermarket were used as a pre-immune (negative) control. Egg yolk 
immunoglobulin (IgY) was extracted from the 2 groups of eggs as described in Example I 
(e). and the final IgY pellets were solubilizcd in one-fourth the original yolk volume of 
Ensure^ nutritional formula. 

h) In vivo Treatment Of Hamsters With Established C. difficile 
Infection 

I he av ian C. difficile antitoxins prepared in section (a I above were evaluated for the 
ability to treat established C. difficile infection in hamsters using an animal model system 
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which was modified from the procedure which was described for the hamster protection study 
in Example 8(b) above. 

For the study, four separate experimental groups were used, with each group consisting 
of 7 female Golden Syrian hamsters (Charles River), approx. 10 weeks old. weighing 
approximately 100 gms. each. Each animal was housed separately, and was offered food and 
water ad lihitum through the entire length of the study. 

On day I of the study, the animals in all four groups were each predisposed to C 
difficile infection by the oral administration of 3.0 mg of clindamycin-! ICI (Sigma) in I ml of 
water. 

On day 2. each animal in all four groups was orally challenged with I ml of 
C. difficile inoculum, which contained approximately 100 C. difficile strain 43596 organisms 
in sterile saline. C difficile strain 43596 was chosen because it is representative of one of the 
most frequently-occurrini! serogroups isolated from patients with antibiotic-associated 
pseudomembranous colitis. [Delmee et «/.. J. Clin. Microbiol.. 28:2210-2214 ( 1990). | In 
addition, as this was the same C. difficile strain used in all of the previous Examples above, it 
was auain used : in order to provide experimental continuity. 

On day 3 of the study (24 hrs. post-infection), treatment was started for two of the 
lour groups of animals. Each animal of one group was orally administered 1 .0 ml of the 
CTAIi IgY preparation (prepared in section (a) above) at the following timepoints: 0 hrs.. 4 
hrs.. and 8 hrs. The animals in this group were designated "CTAB-24." The animals in the 
second group were each orally administered 1.0 ml of the pre-immune IgY preparation (also 
prepared in section (a > above) at the same timepoints as for the (.TAB group! These animals 
were designated 'Trc-24." Nothing was done to the remaining two groups of animals on day 

2:> ()n da >" 4 - 4 * h,s - post-infection, the treatment described for day 3 above was repeated 

lor the CTAB-24 and Pre-24 groups, and was initiated for the remaining two groups at the 
same timepoints. The final two groups of animals were designated "CTAB-48" and 'Tre-48" 
respectively. 

On days 5 through 9. the animals in all four groups were administered antitoxin or 
pre-immune IgY. 3x per day. as described lor day 4 above. The four experimental groups are 
summarized in Table [4. 
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Group Designation 


' - Experimental Treatment 1 


CTAB-24 


Infected, treatment w/antitoxin | g Y started ^ 24 hrs. post-infection 


l're-24 


Infected, treatment w/pre-immune IgY started Uh 24 hrs. post-infection 


CTAD-48 


Intected. treatment w/antitoxin h?Y started (w 48 hrs. post-infection 


Pre-48 


infected, treatment w/pre-immune l»Y staned M 48 hrs. post-infection. 


All animals were observed for the onset of diarrhea and death through the conclusion 
»l the study on the morning of day 10. The results of this study are displayed in Table 15. 




TABLE 15 






nxperimental Outcome-Dav 10 


Treatment Group 


No. Animals Survivinu 


No. Animals Dead' 


C TAB-24 


6 


1 


Pre-24 


0 


7 


CTAB-48 


4 


,i 


Pre-48 


2 


5 



20 



23 



Eighty-six percent of the animals which began receiving treatment with antitoxin IgY 
at 24 hrs. post-infection (GTAB-24 above, survived, while 57% of ,he animals treated with 
antitoxin IgY starting 48 hrs. post-infection (CTAB-48 above, survived. In contrast, none of 
the animals receiving pre-immunc IgY starting 24 hrs. post-infection (Pre-24 above) survived, 
and only 29% of the animals which began receiving treatment with pre-immunc IgY at 4X 
hrs. post-infection (Pre-48 above) survived through the conclusion of the study. These results 
demonstrate that avian antitoxins raised against C. difficile toxins A and U are capable of 
successfully treating established C. difficile infections in vivo. 



30 



c) Histologic Lvaluation Of Cecal Tissue 

In order to further evaluate the ability of the IgY preparations tested in this study to 
treat established C. difficile infection, histologic evaluations were performed on cecal tissue 
specimens obtained from representative animals from the study described in section <b> above. 

Immediately following death, cecal tissue specimens were removed from animals 
which died in the i>re-24 and l're-4X groups, f ollowing the completion of the study, a 
representative surviving animal was sacrificed and cecal tissue specimens were removed from 
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the CTAB-24 and CTAB-48 groups. A single untreated animal from the same shipment as 
those used in the study was also sacrificed and a cecal tissue specimen was removed as a 
normal control. All tissue specimens were fixed overnight at 4°C in'l0% buffered formalin. 
The fixed tissues were paraffin-embedded, sectioned, and mounted on glass microscope slides. 
The tissue sections were then stained using hematoxylin and eosin (II and E stain), and were 
examined hy light microscopy. 

I 'pon examination, the tissues obtained from the CTAB-24 and CTAB-48 animals 
showed no pathology, and were indistinguishable from the normal control. This observation 
provides f urther ev idence for the ability of avian antitoxins raised against C. difficile toxins A 
and H to effectively treat established C. difficile infection, and to prevent the pathologic 
consequences which normally occur as a result of C. difficile disease. 

In contrast, characteristic substantial mucosal damage and destruction was observed in 
the tissues of the animals from the !>rc-24 and Pre-48 groups which died front < difficile 
disease. Normal tissue architecture was obliterated in these two preparations, as most of the 
mucosal layer was observed to have sloughed away, and there were numerous large 
hemorrhagic areas containing massive numbers of erythrocytes. 

EXAMPLE II 

Cloning And F.xpression Of C difficile Toxin A f ragments 

The toxin A gene has been cloned and sequenced, and shown to encode a protein of 
predicted MW of 308 kcl. [Dove el at.. Infect. Immun.. 58:480-488 ( |99(».| Given the 
expense and difficulty of isolating native toxin A protein, it would be advantageous to use 
simple and inexpensive procaryotic expression systems to produce and purify high levels of 
recombinant toxin A protein for immunization purposes. Ideally, the isolated recombinant 
protein would be soluble in order to preserve native antigenicity, since solubilized inclusion 
body proteins often do not fold into native conformations. To allow ease of purification, the 
recombinant protein should be expressed to levels greater than 1 mg/liter of E. coli culture. 
To determme whether high levels of recombinant toxin A protein can be produced in 
coli. fragments of the toxin A gene were cloned into various prokaryotic expression 
sectors, and assessed for the ability to express recombinant toxin A protein in E. coli. Three 
prokaryotic expression systems were utilized. These systems were chosen because they drive 
expression of either fusion (pMALc and P GEX2T) or native ( P LT23a-c> protein to high levels 
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in E. coli. and allow affinity purification of the expressed, protein on a ligand containing 
column. Fusion protein expressed from pGEX vectors bind glutathione agarose beads, and 
are eluted with reduced glutathione. pMAL fusion proteins bind amylose resin, and are eluted 
with maltose. A poly-histidine lag is present at either the N-tcrminal (pETI6b) or C-tcrminal 
<pET23a-c> end of pET fusion proteins. This sequence specifically binds Ni, chelate columns, 
and is eluted with imidazole salts. Extensive descriptions of these vectors arc available 
[Williams « «/. (1995) DMA Cloning 2: Expression Systems. Glover and llames. eds. IRL 
Press. Oxford, pp. I5-58J. and will not be discussed in detail here. The Example involved (a) 
cloning of the toxin A gene, (b) expression of large f ragments of toxin A in various 
prokaryotic expression systems, (c) identification of smaller toxin A gene fragments that 
express efficiently in E. coli. (d) purification of recombinant toxin A protein by affinity 
chromatography, and (e) demonstration of functional activity of a recombinant fragment of the 
toxin A gene. 

, r) s ») Cloning Of The Toxin A Gene 

A restriction map of the toxin A gene is shown in Figure 6. The encoded protein 
contains a carboxy terminal ligand binding region, containing multiple repeats of a 
carbohydrate binding domain. |von Eichel-Streibcr and Sauerbom. Ciene 96.107-1 13 (1990).] 
The toxin A gene was cloned in three pieces, by using either the polymerase chain reaction 
(PCR) to amplify specific regions, (regions I and 2. f igure 6) or by screening a constructed 
genomic library for a specific toxin A gene fragment (region 3. Figure <». I ho sequences of 
the utilized I'CR primers arc PI: 5" GGAAATT TAGCTGCAGCATCTGAC .V (SEQ ID 
NO.:H: P2: 5" TCTAGC A A ATTCGCTTGT GTTG A A .V (SEQ ID NO.:2): P3: 5" 
CTCXiC A T ATAGCATTAGACC .V (SEQ ID NO.:3): and P4: 5" 
C T ATCTAGGCCTAAAGTAT 3" (SEQ ID NO.:4). These regions were cloned into 
prokaryotic expression vectors that express either fusion (pMAI.c and pGEX2T) or native 
(pET23a-c> protein to high levels in E coli. and allow affinity purification of the expressed 
protein on a ligand containing column. 

Clostridium difficile VPI strain 10463 was obtained from the A I CC (ATCC W43255) 
and grown under anaerobic conditions in brain-heart infusion medium (RBL). High 
molecular-weight C. difficile DNA was isolated essentially as described by Wren and 
Tabaqchali (1987) j. Clin. Microbiol.. 25:2402. except proteinase K and sodium dodecyl 
sulfate (SDS) was used to disrupt the bacteria, and cetyltrimethylammonium bromide 
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precipitation [as described in Ausubel et aL. Current Protocols in Molecular Biology (1989)] 
was used to remove carbohydrates from the cleared lysate. The integrity and yield of genomic 
DNA was assessed by comparison with a serial dilution of uncut lambda DNA after 
electrophoresis on an agarose gel. 

Fragments I and 2 were cloned by PCR. utilizing a proofreading thermostable DNA 
polymerase (native/?/// polymerase. Stratagene). The high fidelity of this polymerase reduces 
the- mutation problems associated with amplification by error prone polymerases .(e.g.. Taq 
polymerase). PCR amplification was performed using the indicated PCR primers (Figure 6) 
in 50 ul reactions containing 10 mM Tris-HCI(8.3), 50 mM KG. 1.5 mM MgCI,. 200 uM 
each dNTP. 0.2 ^iM each primer, and 50 ng C difficile genomic DNA. Reactions were 
overlaid with 100 uj mineral oil. heated to 94°C for 4 min. 0.5 ul native pfu polymerase 
(Stratagene) added, and the reaction cycled 30x at 94°C for I min. 50°C for I min. 72°C for 
4 min. followed by 10 min at 72°C\ Duplicate reactions were pooled, chloroform extracted, 
and ethanol precipitated. After washing in 70% ethanol. the pellets were resuspended in 50 pi 
TE buffer [10 mM Tris-HCE. 1 mM EDTA pH 8.0]. Aliquots of lOul each were restriction 
digested with either EcoRVHincW (fragment I ) or EcoRUPstl (fragment 2). and the 
appropriate restriction fragments were gel purified using the Prep-A-Gehe kit (BioRad). and 
ligated to either EcolU/Swal -restricted pGEX2T (Pharmacia) vector (fragment I), or the 
Eco\\VPst\ pMAlc (New England Biolabs) vector (fragment 2). Both clones are predicted to 
produce in-frame fusions with either the glutathione-S-transferase protein (pGEX vector) or 
the maltose binding protein (pMAL vector). Recombinant clones were isolated, and 
confirmed by restriction digestion, using standard recombinant molecular biology techniques. 
(Sambrook et al. Molecular Cloning A Laboratory Manual (1989). and designated pGA30- 
660 and pMA660-l 100. respectively {see Figure 6 for description of the clone designations).) 

f ragment 3 was cloned from a genomic library of size selected Pst\ digested 
( difficile genomic DNA. using standard molecular biology techniques (Sambrook et aL). 
Given that the fragment 3 internal Psi\ site is protected from cleavage in C\ difficile genomic 
DNA (Price et aL. Curr. Microbiol.. 16:55-60 (1987)], a 4.7 kb fragment from Pst\ restricted 
C. difficile genomic DNA was gel purified, and ligated to Pay I restricted, phosphatase treated 
pUC9 DNA. The resulting genomic library was screened with a oligonucleotide primer 
specific to fragment 3. and multiple independent clones were isolated. The presence of 
fragment 3 in several of these clones was confirmed by restriction digestion, and a clone of 
the indicated orientation (Figure 6) was restricted with BamlU/HmdllL the released fragment 
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purified by ye! electrophoresis, and ligated into similarly restricted P ET23c expression vector 
DNA (Novagen). Recombinant clones were isolated, and confirmed by restriction digestion 
Ih.s construct is predicted to create both a predicted in frame fusion with the pET protein 
leader sequence, as well as a predicted C-terminal p<,ly-his,idine affinity tag. and is designated 
pPA I 100-2680 (see Figure 6 lor the clone designation). 

H) Expression Of Large Fragments Of Toxin A In £1 coli 
Protein expression from the three expression constructs made in (a) was induced and 
analyzed by Western blot analysis with an affinity purified, goat polyclonal antiserum directed 
aga.nst the toxin A toxoid (Tech Lab). The procedures utilized for protein induction. SDS- 
I'AGL. and Western blot analysis arc described in detail in Williams ct al ( 1 995). supra. In 
brief. 5 ml 2X YT (16 g tryptonc. 10 g yeast extract. 5 g NaCI per liter, pll 7,5 + 100 ug/ml 
ampicillin were added to cultures of bacteria (BL2 1 for pMAl and pGEX plasmids. and 
BL21(DE3)l.ysS for P ET plasmids) containing the appropriate recombinant clone which were 
induced to express recombinant protein by addition of IPTG to I mM. Cultures were grown 
at 37«C. and induced when the cell density reached 0.5 Of),,,. Induced protein was allowed 
to accumulate for two hrs after induction. Protein samples were prepared by pelleting I ml 
aliquots of bacteria by centrifugation ( I min in a microfuge). and resuspension of the pelleted 
bacteria in 150 ul of 2x SDS-PAGE sample buffer |Williams ,/ <,/.( |W 5 ). xupra \. The 
samples were heated to 95»C for 5 mi,, the cooled and 5 or 10 ul aliquots loaded on 7.5% 
SOS-PAGE gels. BioRad high molecular weight protein markers were also loaded, to allow 
estimation of the M W of identified fusion proteins. After electrophoresis, protein was 
detected either generally by staining gels with Coomassie blue, or specifically, by blotting to 
nitrocellulose for Western blol detection of specific immunoreactive protein. Western blots, 
(performed as described in Example 3) which detect toxin A reactive protein in cell lysates of 
induced protein from the three expression constructs are shown in f igure 7. In this figure, 
lanes 1-3 contain cell lysates prepared from /.. coli strains containing pl>Al 100-2860 in 
BI2l(l)E3)lysE cells: lanes 4-6 contain cell lysates prepared from E. coli strains containing 
pPA. 100-2860 in B12l(DL3)lysS cells: lanes. 7-9 contain cell lysates prepared from /, coli 
strains containing pMA30-660: lanes 10-12 contain cell lysates prepared from coli strains 
containing pMA660-l 100. The lanes were probed with an affinity purified goal antitoxin A 
polyclonal antibody (Tech Lab). Control lysates from uninduced cells (lanes I. 7. and 10) 
contain very little immunoreactive material compared to the induced samples in the remaining 
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lanes. The highest molecular weight band observed for each clone is consistent with the 
predicted size of the full length fusion protein. 

each construct directs expression of high molecular weight (H M W) protein that is 
reactive with the toxin A antibody. The size of the largest immunoreactive bands from each 
sample is consistent with predictions of the estimated MW of the intact fusion proteins. This 
demonstrates that the three fusions are in-frame, and that none of the clones contain cloning 
artifacts that disrupt the integrity of the encoded fusion protein. However, the Western blot 
demonstrates that fusion protein from the two larger constructs <pGA30-660 and pPAl 100- 
2680) are highly degraded. Also, expression levels of toxin A proteins from these two 
constructs are low. since induced protein bands are not visible by Coomassic staining (not 
shown). Several other expression constructs that fuse large sub-regions of the toxin A gene to 
either pMALc or pET23a-c expression vectors, were constructed and tested tor protein 
induction. These constructs were made by mixing gel purified restriction fragments, derived 
from the expression constructs shown in Figure 6. with appropriately cleaved expression 
vectors, ligating. and selecting recombinant clones in which the toxin A restriction fragments 
had ligated together and into the expression vector as predicted for in-frame fusions. The 
expressed toxin A interval within these constructs are shown in Figure 8. as well as the 
internal restriction sites utilized to make these constructs. 

As used herein, the term "interval" refers to any portion (/>.. any segment of the toxin 
which is less than the whole toxin molecule) of a clostridial toxin. In a preferred 
embodiment, "interval" refers to portions of C difficile toxins such as toxin A or toxin B, It 
is also contemplated that these intervals will correspond to epitopes of immunologic 
importance, such as antigens or immunogens against which a neutralizing antibody response is 
effected. It is not intended that the present invention be limited to the particular intervals or 
sequences described in these Examples. It is also contemplated that sub-portions of intervals 
ley., an epitope contained within one interval or which bridges multiple intervals) be used as 
compositions and in the methods of the present invention. 

In all eases. Western blot analysis of each of these constructs with goat antitoxin A 
antibody (Tech Lab) detected HMW fusion protein of the predicted size (not shown). This 
confirms that the reading frame of each of these clones is not prematurely terminated, and is 
fused in the correct frame with the fusion partner. However, the Western blot analysis 
revealed that in all cases, the induced protein is highly degraded, and. as assessed by the 
absence of identifiable induced protein bands by Coomassic Blue staining, are expressed only 
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at low levels. These results suggest that expression of high levels of intact toxin A 
recombinant protein is not possible when large regions of the toxin A gene are expressed in 
E. coli 'using these expression vectors. 

c) High Level Expression Of Small Toxin A Protein Fusions In 
E. coli 

Kxpcricnce indicates that expression difficulties are often encountered when large 
(greater than 100 kd) fragments are expressed in E. ,oti. A number of expression constructs 
conta.ning smaller fragments of the toxin A gene were constructed, to determine if small 
regions of the gene can be expressed to high levels without extensive protein degradation. A 
summary of these expression constructs are shown in Figure 9. All were constructed bv in- 
frame fusions of convenient toxin A restriction fragments to either the pMAI.c or pET23a-c 
vectors. Protein preparations from induced cultures of each of these constructs were analvzed 
by both Coomassie Blue staining and Western analysis as in (b) above. In all cases, higher 
levels of iritact ■ full length fusion proteins were observed than with the larger recombinants 
from section (b). 

d) Purification Of Recombinant Toxin A Protein 

Large scale (500 ml) cultures of each recombinant from (c) were grown, induced, and 
soluble and insoluble protein fractions were isolated. The soluble protein extracts were 
affinity chromatographed to isolate recombinant fusion protein, as described ( Williams a al. 
(1994). supra]. In brief, extracts containing tagged pl-T fusions were chromatographed on a 
nickel chelate column, and e|utcd using imidazole salts as described by the distributor 
(Novagen). Extracts containing soluble pMAL fusion protein were prepared and 
chromatographed in column buffer (10 mM NaPO,. 0.5M NaCI. 10 mM |i-mercaptocthanol. 
pH 7.2) over an amylose resin column (New England Biolabs). and clutcd with column 
buffer containing 10 mM maltose as described [Williams ci al. (1995). .m,,ra\. When the 
expressed protein was found to be predominantly insoluble, insoluble protein extracts were 
prepared by the method described in Example 17. infra. The results are summarized in fable 
16. Figure 10 shows the sample purifications of recombinant toxin A protein. In this figure, 
lanes I and 2 contain MBP fusion protein purified by affinity purification ol soluble protein. 
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TABLE 16 

Purification Of Recombinant Toxin A Protein 



Clone ^ 


Protein 
Solubility 


Yield Affinity 
Purified Soluble 

rrOletn 


% Intact Soluble 
Fusion Protein tc * 


■ icio iniuct 
Insoluble Fusion 
Protein 


pMA30-270 


C 0 l tl UL> 


4 mg/500 mis 


10% 


NA 


PMA30-300 




4 mg/500 mis 


5-10% 


NA 


pM A 300-660 


liicrklnHI** 
lilol'lUUIC 




NA 


10 mg/500 ml 


pMA660-IIOO 


Soluble 


4 S mp/500 mlc 


Mr>o 


NA 


pMAI 100-1610 


Soluble 


18 mg/500 mis 


10% 


NA 


pMAI6IO-!870 


Both 


22 mg/500 mis 


90% 


20 mg/500 ml 


pMA 1450- 1870 


Insoluble 




NA 


0.2 mg'500 ml 


pPAl 100- 1450 


Soluble 


0.1 mg/500 mis 


90% 


NA 


pPAl 100-1870 


Soluble 


' 0.02 mg/500 mis 


90% 


NA 


pMA 1870-26X0 


Both 


12 mg/500 mis 


80% 


NA 


pPa 1870-2680 


Insoluble 




NA 


10 mg/500 ml 



,u. pl> " P ET23 vec, or. pM=pMAU vector. A = toxin A. 

1, Based on 1.5 OD : ,„ - I msi/nil (extinction coefficient o( M»P). 

I isiimated by Coomassie staining of SDS-PAGE iicls. 

Lanes 3 and 4 contain MBP fusion protein purified by solubilization of insoluble inclusion 
bodies. The purified fusion protein samples are pMA1870-2680 (lane I). pMA6(>0 i 100 (lane 
2). pMA300-600 (lane 3) and pMAI450-I870 (lane 4). 

Poor yields of affinity purified protein were obtained when poly-histidine tagged P ET 
vectors were used to drive expression (pPAl 100-1450. pPI 100-1870). However, significant 
protein yields were obtained from pMAI. expression constructs spanning the entire toxin A 
gene, and yields of full-length soluble fusion protein ranged from an estimated 200-400 
Mg/500 ml culture (pMA30-30O) to greater than 20 mg/500 ml culture (pMA 161 0-1 870). 
Only one interval was expressed to high levels as strictly insoluble protein ( P MA30()-660). 
Thus, although high level expression was not observed when using large expression constructs 
from the toxin A gene, usable levels of recombinant protein spanning the entire toxin A gene- 
were obtainable by isolating induced protein from a series of smaller pMAI. expression 
constructs that span the entire toxin A gene. This is the first demonstration of the feasibility 
of expressing recombinant toxin A protein to high levels in E. co/i. 
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c) Hemagglutination Assay Using The Toxin A Recombinant 
Proteins 

The carboxy terminal end consisting of the repeating units contains the 
hemagglutination activity or binding domain of C difficile toxin A. To determine whether 
the expressed toxin A recombinants retain Junctional activity, hemagglutination assays were 
performed, Two toxin A recombinant proteins, one containing the binding domain as cither 
soluble affinity purified protein (pMA 1 870-2680) or SDS solubilized inclusion body protein 
(pPA 1 8 70-2680) and soluble protein from one region outside that domain (pMAI 100-1610) 
were tested using a described procedure. (H.C Krivan ei at.. Infect Immun.. 53:573 
( 1 986). | Citrated rabbit red blood cells (RRBC)(Cocalieo> were washed several times with 
Tris-bulVer ( O.IM Tris and 50 mM NaCI ) by centrifugation at 450 x g for 10 minutes at 4° 
C. A 1%'RRBC suspension was made from the packed cells and resuspended in I ris-buffer. 
Dilutions of the recombinant proteins and native toxin A (Tech Labs) were made in the I ris- 
buffer and .added in duplicate to a round-bottomed 06-well microliter plate in a final volume 
of 100 p|. To each well. 50 ul of the 1% RRBC suspension was added, mixed by gentle 
tapping, and incubated at 4°C for 3-4 hours. Significant hemagglutination occurred' only in 
the recombinant proteins containing the binding domain (pMA 1870-2680) and native toxin 
A. The recombinant protein outside the binding domain (pMA 1 100-1610) displayed no 
hemagglutination activity. Using equivalent protein concentrations, the hemagglutination titer 
for toxin A was 1:256. while titers for the soluble and insoluble recombinant proteins of un- 
binding domain were 1 :256 and about 1 :5000. Clearly: .he recombinant proteins tested 
retained functional activity and were able to bind RRBC's. 

EXAMPLE 12 

functional Activity Of IgY Reactive Against Toxin A Recombinants 

The expression of recombinant toxin A protein as multiple fragments in L.co/i has 
demonstrated the feasibility of generating toxin A antigen through use of recombinant 
methodolog.es (Example 1 1 ). The isolation of these recombinant proteins allows the 
immunoreactivity ol each individual subrcgion of the toxin A protein to be determined .(i.e.. 
in a antibody pool directed against the native toxin A protein). This identifies the regions (if 
any) for which little or no antibody response is elicited when the whole protein is used as a 
immunogen. Antibodies directed against specific fragments of the toxin A protein can be 
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purified by affinity chromatography against recombinant toxin A protein, and tested for 
neutralization ability. This identifies any toxin A subregions that arc essential for producing 
neutralizing antibodies. Comparison with the levels of immune response directed against " 
these intervals when native toxin is used as an immunogen predicts whether potentially higher 
tilers of neutralizing antibodies can be produced by using recombinant protein directed against 
a individual region, rather than the enure protein. Finally, since it is unknown whether 
antibodies reactive to the recombinant toxin A proteins produced in Example I I neutralize 
toxin A as effectively as antibodies ra.sed against native toxin A (Examples 0 and 10). thc 
protective ability of a pool of antibodies affinity purified against recombinant toxin A 
fragments was assessed for its ability to neutralize toxin A. 

This Example involved (a) epitope mapping of the toxin A protein to determine the 
titre of specific antibodies directed against individual subregions of the toxin A protein when 
native toxin A protein is used as an immunogen. (b) affinity purification of IgY reactive 
against recombinant proteins spanning the toxin A gene. (c> toxin A neutralization assavs with 
affinity purified IgY reactive to recombinant loxin A protein to identify subregions of the 
loxin A protein that induce the production of neutralizing antibodies, and determination of 
whether complete neutralization of toxin A can be elicited with a mixture of antibodies 
reactive to recombinant toxin A protein. 

a) Epitope Mapping Of Thc Toxin A Gene 

The al linity purification of recombinant toxin A protein specific to defined intervals of 
Uic toxin A protein allows epitope mapping of antibody pools directed against native toxin A. 
This has not previously been possible, since previous expression of toxin A recombinants has 
been assessed only by Western blot analysis, without knowledge of the expression levels of 
the protein [c.c. von Eichel-Slreiber et a/. J. Gen. Microbiol.. 1 35:55-64 ( I989)|. Thus, high 
or low reactivity of recombinant toxin A protein on Western blots may reflect protein 
expression level differences, not immunoreactivity differences. Given that the purified 
recombinant protein generated in Example 1 I have been quantitated. the issue of relative 
■mmunoreactivity of individual regions of the toxin A protein was precisely addressed. 

I or the purposes of this Example, the toxin A protein was subdivided into 6 imcrvals 
( I -6). numbered from the amino ( interval 1 ) to the carboxyl (interval 6) termini. 

The recombinant proteins corresponding to these intervals were from expression clones 
(sec Example 1 1(d) for clone designations) pMA30-300 (interval I ). P MA300-660 (interval 
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2). pMA660-l 100 (interval 3). pPAl 100-1450 (interval 4). pMA1450-1870 (interval 5) and 
pMA 1 870-2680 (interval 6). These 6 clones were selected because they span the entire 
protein from amino acids numbered 30 through 2680. and subdivide the protein into 6 small 
intervals. Also, the carbohydrate binding repeat interval is contained -specifically in one 
interval (interval 6). allowing evaluation of the immune response specifically directed against 
this region. Western blots of 7.5% SDS-PAGE gels, loaded and electrophoresed with defined 
quantities of each recombinant protein, were probed with cither goat antitoxin A polyclonal 
antibody ( lech Lab) or chicken antitoxin A polyclonal antibody |pCTA IgY. Example 8(c)). 
The blots were prepared and developed with alkaline phosphatase as previously described 
I Williams el a/. (1095). supra]. At least 90% of all reactivity, in cither goal or chicken 
antibody pools, was found to be directed against the ligand binding domain (interval 6). The 
remaining immunorcactivity was directed against all five remaining intervals, and was similar 
in both antibody pools, except that the chicken antibody showed a much lower reactivity 
against interval 2 than the goal antibody. 

This clearly demonstrates that when native toxin A is used as an immunogen in goats 
or chickens, the bulk of the immune response is directed against the ligand binding domain of 
the protein, with ihe remaining response distributed throughout the remaining 2/3 of the 
protein. 

I>) Affinity Purification Of IgY Reactive Against Recombinant 
Toxin A Protein 

Al Unity columns, containing recombinant toxin A protein from the 6 defined intervals 
in (a) above, were made and used to (i) affinity purify antibodies reactive to each individual 
interval from the C I A IgY preparation | Example 8(c) |. and (ii) deplete interval specific 
antibodies from the CTA IgY preparation. Affinity columns were made by coupling 1 ml of 
PBS-vvashed Actigcl resin (Sterogenc) with region specific protein and 1/10 final volume of 
Aid-coupling solution <IM sodium cyanoborohydride). The total region specific protein 
added to each reaction mixture was 2.7 mg (interval 1). 3 mg (intervals 2 and 3). 0.1 mg 
(interval 4). 0.2 mg (interval 5) and 4 mg (interval 6). Protein for intervals 1. 3. and ft was 
affinity purified pMAl fusion protein in column buffer (see Example II). Interval 4 was 
affinity purified poly-histidine containing pET fusion in PBS: intervals 2 and 5 were from 
inclusion body preparations of insoluble pMAI. fusion protein, dialyzed extensively in PUS. 
Aliuuois of the supernaiants from the coupling reactions, before and after coupling, were 
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assessed by Coomassie staining of 7.5% SDS-PAGE gels. Based on protein band intensities, 
in all cases greater than 50% coupling efficiencies were estimated. The resins were poured 
into 5 ml BioRad columns, washed extensively with PBS. and stored at 4°C. 

Aliquots of the CTA IgY polyclonal antibody preparation were depleted for each 
individual region as described below. A 20 ml sample of the CTA IgY preparation | Example 
X(c)| was dialyzed extensively against 3 changes of PBS ( I liter lor each dialysis), quantified 
hy.absortwncc.at OlX xll . and stored at 4°G. Six I ml aliquots of the dialyzed IgY preparation 
were removed, and depleted individually for each of the six intervals. Kach 1 ml aliquot was 
passed over the appropriate affinity column, and the eluate twice reapplied to the column. 
The eluate was collected, and pooled with a I ml PBS wash. Bound antibody was eluted 
from the column by washing with 5 column volumes of 4 M Ciuanidine-HCi fin 10 mM Tris- 
I ICI. pi I 8.0). The column was reequi Iterated in PBS. and the depleted antibody stock 
reapplied as described above. The eluate was collected, pooled with a 1 ml PBS wash, 
quantilated by absorbance at OD,„. and stored at 4° C. In this manner. 6 aliquots of the CTA 
IgY preparation were individually depleted for each of the 6 toxin A intervals, by two rounds 
ol affinity depletion. The specificity of each depleted stock was tested by Western blot 
analysis. Multiple 7.5% SDS-PAGli gels were loaded with protein samples corresponding to 
all ft toxin A subregions. After electrophoresis, the gels were blotted, and protein transfer 
confirmed by Ponceau S staining [protocols described in Williams el al (IWS). supra\. After 
blocking the blots 1 hr at 20°C in PBS+ 0.1% Tween 20 (PBST) containing 5% milk (as a 
blocking buffer). 4 ml of either u 1/500 dilution of the dialyzed CTA IgY preparation in 
blocking buffer, or an equivalent amount of the six depleted antibody stocks (using Of).*, to 
standardize antibody concentration) were added and the blots incubated a further I hr at room 
temperature. The blots were washed and developed with alkaline phosphatase (using a rabbit 
anti-chicken alkaline phosphate conjugate as a secondary antibody) as previously described 
| Williams a al. (1995). xnpra\. In all cases, only the target interval was depleted for antibody 
reactivity, and at least 90% of the reactivity to the target intervals was specifically depleted. 

Region specific antibody pools were isolated by affinity chromatography as described 
below, Ten mis of the dialyzed (TA IgY preparation were applied sequentially to each 
affinity column, such that a single 10 ml aliquot was used to isolate region specific antibodies 
specific to each of the six subregions. The columns were sequentially washed with 10 
volumes of PBS. 6 volumes of BBS-Tween. 10 volumes of TBS. and eluted with 4 ml 
Actiscp elution media (Sterogerie). The eluate was dialyzed extensively against several 
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changes of PBS. and the affinity purified antibody collected and stored at 4<>C. The volun.es 
of the eluate increased to greater than 10 mis during dialysis in each case, due to the high 
viscosity of the Actisep elution media. Aliquots of each sample were 30x concentrated using 
Ccntncon 30 nnenrconcentrators (Amicon) and stored a, 4T. The specificity of each region 
spectllc antibody pool was tested, relative to the dialyzed CTA IgY preparation, by Western 
blot analysts, exactly as described above, except that 4 ml samples of blockinu buffer 
containing .00 M | region specific antibody (unconcentrated) were used instead of the depleted 
C I A IgY preparation, Each affinity purified antibody preparation was specific to the defined 
..nerval, except that samples purified against intervals 1-5 also reacted with interval 6 This 
may be due to non-specific binding to the interval 6 protein, since this protein contains the 
repct.me hgand binding domain which has been shown to bind antibodies nonspecifica.lv. 
|Lycrl\ cial.. Curr. Microbiol.. 19:303-306 (1989).| 

The reactivity of each affinity purified antibody preparation ,o the corresponding; 
proteins was approx innately the same as the reactivity of the 1/500 diluted dialv/ed C TA IgY 
preparation standard. Given that the specific antibody stocks were diluted ./40. this would 
indicate lluu the unconcentrated affinity purified antibody stocks contain I /1 0-1/20 the 
concentration of specific antibodies relative to the starling -CTA IgY preparation. 

e) Toxin A Neutralization Assay Using Antibodies Reactive 
Toward Recombinant Toxin A Protein 

The CI IO toxin neutralization assay (Example 8(d)| was used to assess .he abilitv of 
.he depleted or enriched samples generated in (b> above to neutrals the cytotoxicitv of toxin 
A. The general ability of affinity purified antibodies to neutralize toxin A was assessed bv 
•mixing together aliquots of all 6 concentrated stocks of the 6 affinity purified samples 
generated in (b> above, and testing the ability of this mixture to neutralize a toxin A 
concentration of 0.1 ug/ml. The results, shown in Figure II. demonstrate almost complete 
neutralization of loxin A using the affinity purified ( Al>) mix. Some epitopes within the 
recombinant proteins utilized for affinity purification were probably lost when the proteins 
were denatured before affinity purification (by Ciuanidinc-HCI treatment in (b) above). Thus, 
the neutralization ability of antibodies directed against recombinant protein is probablv 
underestimated using these affinity purified antibody pools. This experiment demonstrates 
that antibodies reactive to recombinant loxin A can neutralize cytotoxicity, suggesting that 
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neutralizing antibodies may be generated by using recombinant toxin A protein as 
immunogen. 

In view of the observation that the recombinant expression clones of the toxin A gene 
divide the protein into 6 subrcgions. the neutralizing ability of antibodies directed against each 
individual region was assessed. The neutralizing ability of antibodies directed against the 
ligand binding domain of toxin A was determined first. 

In the toxin neutralization experiment shown in Figure 1 I. interval 6 specific 
antibodies (interval 6 contains the ligand binding domain » Were depleted from the dialyzed 
PF.G preparation, and the effect on toxin neutralization assayed. Interval 6 antibodies were 
depleted either by utilizing the interval ft depleted CTA IgY preparation from (b) above C -6 
alf. depleted" in Figure I I). or by addition of interval 6 protein to the CTA IgY preparation 
(estimated to be a 10 fold molar excess over anti-interval 6 immunoglobulin present in this 
preparation) to competitively compete for interval 6 protein ("-6 prot depleted" in Figure II ). 
In both instances, removal of interval 6 specific antibodies reduces the neutralization 
efficiency relative to the starling CTA IgY preparation. This demonstrates that antibodies 
directed against interval 6 contribute to toxin neutralization. Since interval 6 corresponds to 
the ligand binding domain of the protein, these results demonstrate that antibodies directed 
againsi this region in the PF.G preparation contribute to the neutralization of toxin A in this 
assay. However, it is significant that after removal of these antibodies, the PFG preparation 
retains significant ability lo neutralize toxin A (Figure I I). This neutralization is probably 
due to the action of antibodies specific to other regions of the toxin A protein, since at least 
')()% of ihe ligand binding region reactive antibodies were remov ed in the depleted sample 
prepared in (b) above. This conclusion was supported by comparison of the toxin 
neutralization of ihe affinity purified (AP) mix compared to affinity purified interval 0 
2:> antibody alone. Although some neutralization ability was observed with AP interval 6 

antibodies alone, the neutralization was significantly less than that observed with the mixture 
ol all 6 AP antibody stocks (not shown). 

Given that the mix of all six affinity purified samples almost completely neutralized 
the cytotoxicity of toxin A (Figure II). ihe relative importance of antibodies directed against 
30 toxin A intervals 1-5 within the mixture was determined. This was assessed in tun ways. 

First, samples containing affinity purified antibodies representing 5 of the C> intervals were 
prepared, such that each individual region was depleted from one sample. Figure 12 
demonstrates a sample neutralization curve, comparing the neutral ization ability of affinity 



20 



- 85 - 



WO 98/08540 PCT/US97/15394 

purified antibody mixes without interval 4 (-4) or 5 (-5) specific antibodies, relative to the 
mix. of all 6 affinity purified antibody stocks (positive control). While the removal of interval 
5 specific antibodies had no effect on toxin neutralization (or intervals 1-3. not shown), the 
loss ol interval- 4 specific antibodies significantly reduced toxin neutralization < Figure 12). 

Similar results were seen in a second experiment, in which affinity purified antibodies 
directed against a single region, were added to interval 6 specific antibodies, and the effects 
on toxin neutralization assessed. Only interval 4 specific antibodies significant enhanced 
neutralization when added to interval 6 specific antibodies ( Figure 1 3). These results 
demonstrate that antibodies directed against interval 4 (corresponding to clone pPAIIGO-1450 
in Figure <» are important for neutralization of cytotoxicity in this assay. Epitope mappine 
has shown that only low levels ol antibodies reactive to this region are generated when native 
u«.n A « used as an immunogen [Example I2(a)|. It is hypothesized that immunization with 
recombinant protein specific to this interval will elicit higher tilers of neutralizing antibodies. 
In summary, this analysis has identified two critical regions of the toxin A protein auainst 
which neutralizing antibodies are produced, as assayed by the CHO neutralization assay. 

EXAMPLE 13 

Production And Evaluation Of Avian Antitoxin 
Against C. difficile Recombinant Toxin A Polypeptide 

In Example 12. we demonstrated neutralization of toxin A mediated cytotoxicity bv 
affinity purified antibodies reactive to recombinant toxin A protein. To determine whether 
antibodies raised against a recombinant polypeptide fragment of r. difficile toxin A may be 
effective in treating clostridial diseases, antibodies to recombinant toxin A protein representing 
•he binding domain were generated. Two toxin A binding domain recombinant polypeptides' 
expressing the binding domain in either the pMALc (pMA 1870-2680) or pET 23(pl'AI870- 
2680) vector, were used as immunogens. The pMAI. protein was affinity purified as a soluble 
product | Example 12(d)] and the pET protein was isolated as insoluble inclusion bodies 
| Example I2(d)| and solubilized to an immunologically active protein using a proprietary 
method described in a pending patent application (U.S. Patent Application Serial No. 
08/12^.027). This Example involves (a) immunization, (b) antitoxin collection, (c) 
determination of antitoxin antibody tiler, (d) ami -recombinant toxin A neutralization of toxin 
A hemagglutination activity ,,, vim,, and (e) assay of in >>i,r„ toxin A neutralizing activity. 
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a) Immunization 

The soluble and the inclusion body preparations each were used separately to 
immunize hens. Both purified toxin A polypeptides were diluted in >PBS and emulsified with 
approximately equal Volumes of CFA lor the initial immunization or I FA for subsequent 
booster immunizations. On day zero, lor each of the recombinant preparations, two egg 
laying white Leghorn hens (obtained from local breeder) were each injected at multiple sites 
(intramuscular and subcutaneous) with I ml of recombinant adjuvant mixture containing 
approximately 0.5 to 1 .5 mgs of recombinant toxin A. Booster immunizations of 1.0 mg 
were given on days 14 and day 28. 

o) Antitoxin Collection 

lotai yolk immune IgY was extracted as described in the standard PFG protocol (as in 
l-xample 1 > and the final IgY pellet was dissolved in sterile I'BS at the original yolk volume. 
I his material is designated "immune recombinant IgY" or "immune liiY." 

c) Antitoxin Antibody Titer 

To determine if the recombinant toxin A protein was sufficiently immunogenic to raise 
antibodies' in hens, the antibody titer of a recombinant toxin A polypeptide was determined by 
I- USA. liggs from both hens were collected on day 32. the yolks pooled and the antibody 
was isolated using PEC) as described. The immune recombinant IgY antibody titer was 
determined for the soluble recombinant protein containing the maltose hinding protein fusion 
generated in p-Mal (pMA 1 870-2680). Ninety-six well Falcon l>ro-bind plates were coated 
overnight at 4°C with 100 ul /well of toxin A recombinant at 2.5 w / M l in |>BS containing 
0 05% ihimcrosal. Another plate was also coated with maltose binding protein (MBP) at the 
same concentration, to permit comparison of antibody reactivity to the fusion partner. The 
next day. the wells were blocked with PBS containing 1% bovine serum albumin (USA) for I 
hour at 37°C\ IgY isolated from immune or preimmune eggs was diluted in antibody diluent 
(PBS containing 1% BSA and 0.05% Tween-20). and added to the blocked wells and 
incubated lor I hour at 37°C The plates were washed three times with PBS with 0.05% 
Tween-20. then three times with PBS. Alkaline phosphatase conjugated rabbit anti-chicken 
IgG (Sigma) diluted 1:1000 in antibody diluent was added to the plate, and incubated for 1 
hour at 37°C\ The plates were washed as before and substrate was added, fp-nitroplicnyl 
Phosphate (Sigma) | at I mg/m| j„ ().()5M Na : CO,. pH 9.5 and 10 mM MgCK. The plates 
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were evaluated quantitatively on a Dynatech MR 300 Micro HP A plate reader at 410 nm 
about 10 minutes after the addition of substrate. 

Based on these ELISA results, high antibody titers were raised in chickens immunized 
with the toxin A recombinant polypeptide. The recombinant appeared to be highly 
immunogenic, as it was able to generate high antibody titers relatively quickly with few 
immunizations. Immune IgY titer directed specifically to the toxin A portion of the 
recombinant was higher than the immune IgY titer to its fusion partner, the maltose binding 
protein, and significantly higher than the prcimmune IgY. I.-L1SA titers (reciprocal of the 
highest dilution of IgY generating a signal) in (he prcimmune IgY to the MBP or the 
recombinant was <I:3() while the immune IgY titers to MBP and the toxin A recombinant 
were 1:1X750 and > 1:93750 respectively. Importantly, the ami-recombinant antibody titers 
generated in the hens against the recombinant polypeptide is much higher, compared to 
antibodies to that region raised using native toxin A. The recombinant antibody liter to 
region 1870-2680 in the C I A antibody preparation is at least live-fold lower compared to the 
recombinant generated antibodies (II 8750 versus -- 1 :93750). Thus, it appears a better 
immune response can be generated against a specific recombinant using that recombinant as 
the immunogen compared to the native toxin A. 

This observation is significant, as it shows that because recombinant portions stimulate 
the production of antibodies, it is not necessary to use native toxin molecules to produce 
antitoxin preparations. Thus, the problems associated with the toxicity of the native toxin arc 
avoided and large-scale antitoxin production is facilitated. 

d) Anti-Rccombinant Toxin A Neutralization Of Toxin A 
Hemagglutination Activity in Vitro 

Toxin A has hemagglutinating activity besides cytotoxic and enterotoxin properties. 
Specifically, toxin A agglutinates rabbit erythrocytes by binding to a trisaccharide (gal 1-3BI- 
4GlcNAc) on the cell surface. fH. Krivan et at.. Infect. Immuii.. 53:573-581 (I986).| We 
examined whether the anti-recombinant toxin A (immune IgY. antibodies raised against the 
insoluble product expressed in pET) can neutralize the hemagglutination activity of toxin A in 
vitro. The hemagglutination assay procedure used was described by ll.C. Krivan el al 
Polyethylene glycol-fractionatcd immune or preimmime IgY were pi c-absorbed with citratcd 
rabbit erythrocytes prior to performing the hemagglutination assay because we have found that 
IgY alone can agglutinate red blood cells. Citratcd rabbit red blood cells (RKBC'sKCocaiico) 
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were washed 'twice -by' centrifugation at 450 x g with isotonic buffer (0.1 M Tris-HCl. 0.05 M 
NaCI. pH 7.2). RRBC-rcactive antibodies in the IgY were removed by preparing a 10% 
RRBC suspension (made by adding packed cells to immune or preimmune IgY) and 
incubating Hie. mixture for I hour at 37°C. The RRBCs were then removed by centrifugation. 
Neutralization of the hemagglutination activity of toxin A by antibody was tested in round- 
bottomed M6-well microliter plates. Twenty-five ^il of toxin A (36 j.tg /ml) (Tech Lab) in 
isotonic buffer was mixed with an equal volume of different dilutions of immune or 
preimmune IgY in isotonic buffer, and incubated for 15 minutes at room temperature. Then. 
50 m) of a 1% RRBC suspension -in isotonic buffer was added and the mixture was incubated 
lor 3 hours at 4°(\ Positive control wells containing the final concentration of 9 ng/ml of 
toxin A after dilution without IgY were also included. Hemagglutination activity was 
assessed visually, with a diffuse matrix of RRBCs coating the bottom of the well 
representing a positive hemagglutination reaction and a tight button of RRBC s at the bottom 
of the well representing a negative reaction. The anti-reeombinam immune IgY neutralized 
toxin A hemagglutination activity, giving a neutralization titer of 1:8. However, preimmune 
IgY was unabie to neutralize the hemagglutination ability of toxin A. 

e) Assay Of in Vitro Toxin A Neutralizing Activity 

The ability of the anti-recombinant toxin A IgY (immune IgY antibodies raised against 
pMA 1X70-2680. the soluble recombinant binding domain protein expressed in pMAL. 
designated as Ami-tpx. A-2 in f igure 14 . and referred to as recombinant region 6) and pre- 
immune IgY. prepared as described in Hxample 8(c) above, to neutralize the cytotoxic activity 
of toxin A was assessed /// vitro using the CHO cell cytotoxicity assay, and toxin A { I ech 
Lab) at a concentration of 0. l ug/ml. as described in Lxamplc X(d) above. As additional 
controls, the anti-native toxin A IgY (CTA) and pre-immune IgY preparations described in 
lixample 8(c) above were also tested The results are shown in Figure 14. 

The anti-recombinant toxin A IgY demonstrated only partial neutralization of the 
cytotoxic activity of toxin A. while the pre-immune IgY did not demonstrate any significant 
neutralizing activity. 
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EXAMPLE 14 

In vivo Neutralization Of C. difficile Xoxm A 

- ' . i 
The ability of avian antibodies ( IgY) raised against recombinant toxin A binding 
domain ,o neutralize the enterotoxin activity ol C. difficile toxin A was evaluated in Wvo 
using Golden Syrian hamsters. The Example involved: (a, preparation of the av.an anti- 
recombinant toxin A IgY for oral administration: (b) in >„■„ protection of hamsters from C 
ddfudc toxin A entcrotoxicity by treatment of toxin A with avian anti-recombinant toxin A 
lgY: and (c) histologic evaluation of hamster ceca. 

a) Preparation Of The Avian Anti-Recombinant Toxin A IgY 
For Oral Administration 

liggs were collected front hens which" had been immunized with the recombinant < ' 
difficile toxin A fragment P MA1870-2680 (described in Example 13. above,. A second group 
ol eggs purchased at a local supermarket was used as a pre-immune (negative, control l/ L .« 
yolk immunoglobulin (IgY, was extracted by PEG from the two groups of eg^s as described 
in Example 8(c,. and the linal IgY pellets were solubilized in one-fourth the oriumal volk 
volume using O.IM carbonate butler (mixture of NaHCO, and Na : t(. M : P ll .« 5 The basic 
carbonate buffer was used in order to protect the toxin A from the acidic pll of the stomach 
environment. 

I») in vivo Protection Of Hamsters Against C difficile Toxin A 
Entcrotoxicity By Treatment Of Toxin A With Avian Anti- 
rccombinant Toxin A IgY 

In order to assess the ability of the avian anti-recombinant toxin A IgY. prepared in 
section <a» above to neutralize the in vivo endotoxin activity of toxin A. an in vivo toxin 
neutralization model was developed using Golden Syrian hamsters. This model was based on 
published values for the minimum amount of toxin A required to elicit diarrhea (0.08 mg 
toxin A/Kg body wt. , and death (0. 1 ft mg toxin A/Kg body w.. , i„ hamsters when 
administered orally (l.yerly ct al. Infect. Immun.. 47:349-352 (l<)85). 

l or the study, four separate experimental groups were used, with each group consisting 
of 7 female Golden Syrian hamsters (Charles River), anprox. three and one-half weeks old. 
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weighing approx. 50 gms each. The animals were housed as groups of 3 and 4. and were 
offered food and water ad libUum through the entire length of the study. 

For each animal, a mixture containing either I Oug of toxin A |f0.2 mg/Kg) or 30ug of 
mxin A (0.6 mg/Kg) « difficile toxin A was obtained from Tech Lab and I ml of either the 
anti-recombinan. toxin A IgY or pre-immune IgY ffrom section (a) above) was prepared. 
These mixtures were incubated a, 37°C for 60 min. and were then administered to the animals 
by the oral route. The animals were then observed lor the onset of diarrhea and death for a 
period of 24 hr S . following the administration of the toxin A HgY mixtures, at the end of 
which time, the following results were tabulated and shown in Table 17: 

TABLE 17 

Study Outcome At 24 Hours 



f vpcrimcnul Group 



10 .lie. Toxin A - Antitoxin Against Interval 6 



»<) ny Toxin A - Antitoxin Auainst Interval 6 



10 tia Toxin A • I're-linmunc Serum 



.<» tig -Toxin A IVe-lmmune 



Study Outcome at 24 Hours 



Healthv' 



Diarrhea 2 



t) 



Dead 1 



(I 



Aniiltals remained healthy through the entire 24 hour studv period. 
Animals developed diarrhea, but did not die. 
Animals developed diarrhea, and subsequently died. 

f>retreatmcnt of toxin A at both doses tested, using the unii-recomhinant toxin A IgY. 
prevented all overt symptoms of disease in hamsters. Therefore, pretrcalment of C. difficile 
toxin A. using the anti-recombinant toxin A IgY. neutralized the ,„ viva emcrotoxin activity 
"I the toxin A. In contrast, all animals from the two groups which received toxin A which 
had been nrctreated usmy pre-immune IgY developed disease symptoms which ranged from 
diarrhea to death. The diarrhea which developed in the 5 animals which did not die in each 
of the two pre-immune groups, spontaneously resolved by the end of the 24 hr. study period. 

c) Histologic Evaluation Of Hamster Ccca 

In order to further assess the ability of anli-recombinant toxin A IgY to protect 
hamsters from the emcrotoxin activity of toxin A. histologic evaluations were performed on 
Ihe ceca of hamsters from the study described in section (b) above. 

Three groups of animals were sacrificed in order to prepare histological specimens. 
The lirst group consisted of a single representative animal taken from each of the 4 groups of 
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surviving hamsters at the conclusion of the study described in section (b) above. These 
animals represented the 24 hr. timepoint of the study. 

The second group consisted of two animals which were not part of the studv described 
above, but were separately treated with the same toxin A . pre-immunC IgY mixtures as 
described for the animals in section (b) above. Both of these hamsters developed diarrhea, 
and were sacrificed 8 hrs. after the time of administration of ,he toxin A + prc-immune IgY 
mixtures. At the time o I sacrifice, both animals were presenting symptoms of diarrhea. 
These animals represented the acute phase of the study. 

The final group consisted of a single untreated hamster from the same shipment of 
animals as those used for the two previous groups. This animal served as the normal control. 

Samples of cecal tissue were removed from the 7 animals described above, and were 
.fixed overnight at 4°C using 10% buffered formalin. The fixed tissues were paraffin- 
embedded, sectioned, and mounted on glass microscope slides. The tissue sections were then 
stained using hematoxylin and eosin ,11 and E stain,, and * ere examined by light microscopy. 

The tissues obtained from the two 24 hr. animals which received mixtures containing 
either IO M g or 30ug of toxin A and anti-recombinant toxin A IgY were indistinguishable from 
the normal control, both in terms of gross pathology, as well as at the microscopic level. 
I hese observations provide further evidence for the ability ol anti-recombinant toxin A IgY to 
effectively neutralize the ,„ vivo emerotoxin activity off. c/./ficfc toxin A. and thus its ability 
20 to prevent acute or lasting toxin A-induced pathology. 

In contrast, the tissues from the two 24 hr. animals which received the toxin A + pre- 
immuhe IgY mixtures demonstrated significant pathology. In both of these groups, the 
mucosal layer was observed to be less organized than in the normal control tissue. The 
cytoplasm of the epithelial cells had a vacuolated appearance, and gaps were present between 
25 the epithelium and the underlying cell layers. The lamina propria was largely absent. 

Intestinal villi and crypts were signillcantly diminished, and appeared to have been overgrown 
by a planar layer of epithelial cells and fibroblasts. Therefore, although these animals overtly 
appeared to recover from the acute symptoms of toxin A intoxication, lasting pathologic 
alterations to the cecal mucosa had occurred. 

*° Thc tissucs Plained I™ the two acute animals which received .Mixtures of toxin A 

and pre-immune IgY demonstrated the most significant pathology'. At the gross pathological 
level, both animals were observed to have severely distended ceca which were filled with 
watery, diarrhea-like material. At the microscopic level, the animal that was given the 
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mixture containing «0ug of toxin A and pre-immune IgY was found to have a mucosal lavcr 
which had a ragged, damaged appearance, and a disorganized, compacted quality. The crvpts 
were largely absent, and numerous breaks in the epithelium had occurred. There was also an 
■minx of erythrocytes ,nu> spaces between the epithelial layer and the underlying tissue The 
animal which had received the mixture containing 30pg of toxin A and pre-immune IgY 
demonstrated the most severe pathology. The cecal tissue of this animal had an appearance 
very s.m.lar to thai observed in animals which had died from C. difficile disease. Widespread 
dcs.ruci.on of the mucosa was noted, and the epithelial layer had sloughed. Hemorrhage 
areas contaming large numbers of erythrocytes were very prevalent. All semblance of normal 
nssuc arch.tecture was absent from this specimen. In terms of the presentation of pathologic 
events, tins ,„ ,•„•„ hamster model of toxin A-intoxication correlates verv closeh with the 
..pathologic consequences of C. difficile disease ,n hamsters. The results presented in this 
Sample demonstrate tha, while anti-rccombinant toxin A (Interval 6, IgY is capable of onlv 
part.ally neutralizing the cytotoxic activity of C. difficile toxin A the same antibodv ' 
offec.n ely neutralizes 100% of the /„ rim endotoxin activity of the toxin. White",", is not 
."tended tha, this invention be limited to this mechanism, this may be due to the ototoxicity 
and entcrotoxici.y of r difficile Toxin A as two separate and distinct biological functions. " 

EXAMPLE 15 

In livn Neutralization Of C. Difficile Toxin A By 
Antibodies Against Recombinant Toxin A Polypeptides 

The ability of avian antibodies directed against the recombinant ( ' difficile toxin A 
fragment .870-2680 (as expressed by pMA 1870-2680: see Hxample !3> u> neutralize the 
cntcroioxic activity of toxin A was demonstrated in Example 14. The ability of avian 
antibodies (IgYs) directed against other recombinant toxin A epitopes to neutralize native 
«oxm A ,„ „•„, was next evaluated. This example involved: (a, the preparation of IgYs 
against recombinant toxin A polypeptides: (b, /„ v/vv, protection of hamsters auains, toxin A 
by treatment with anti-recombinant toxin A IgYs and (c) quantification of specific antibodv 
concentration in CTA and Interval 6 IgY PEG preparations. 

The nucleotide sequence of the coding region of the entire toxin A protein is listed in 
S!;Q ID NO:5. The amino acid sequence of the entire toxin A protein is listed in SI-Q ID 
NO.-6. The amino acid sequence consisting of amino acid residues 1X70 throuuh 2680 of 
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toxjn A is listed in SEQ ID NQ:7. The amino acid sequence consisting of amino acid 
residues 1870 through I960 of toxin A is listed in SFQ N> N():8. 

a) Preparation Of I R Y\s Ajjainst Recombinant Toxin a ' 
Polypeptides 

Eggs were collected from Leghorn hens which have been immunized with recombinant 
( Jif/kiU- toxin A rH>lypeptide fragments encompassing the entire toxin A protein. The 
polypeptide fragments used as immunogens were: I ) pMA 1870-2680 (Interval 6). 2) pPA 
1 100-1450 (Interval 4). and 3) a mixture of fragments consisting of pMA 30-300 (Interval I). 
pMA 300-660 (Interval 2). pMA 660-1100 (Interval 3) and pMA 1450-1870 (Interval 5)' 
This mixture of immunogens is referred to as Interval 1 235. The location of each interval 
xvithin the toxin A molecule is shown in Figure 15A. In Figure 15A. the following 
abbreviations arc used: pP refers to the pET23 vector (New England BioLabs): P M refers to 
the pMAI.^-c vector (New lingland BioLabs): A refers to; toxin A: the numbers refer u» the 
amino acid interval expressed in the clone. (For example, the designation P MA30-300 
indicates that the recombinant clone encodes amino acids 30-300 of toxin A and the vector 
used was pMAL IM -c). 

I he recombinant proteins were generated as described in Example II. I he IgYs were 
extracted and solubilized in O.IM carbonate buffer pi I 9.5 lor oral administration as described . 
m Example 14(a). The IgY reactivities against each individual recombinant interval was 
evaluated by LLISA as described in Lxamplc 13(c). 

I») In Vivo Protection Of Hamsters Against Toxin 

A By Treatment With Anti-ltecombinant Toxin A Antibodies 

1 he ability of antibodies raised against recombinant toxin A polypeptides to provide in 
viva protection against the enterotoxic activity of toxin A was examined in the hamster model 
system. This assay was performed as described in Example 14(b). Briclly. for each 40-50 
gram female Golden Syrian hamster (Charles River). 1 ml of IgY 4X (i.e.. resuspended in 1/4 
of the original yolk volume) PEG prep against Interval 6. Interval 4 or Interval 1235 was 
mixed wiih 30 ug (I.D„„, oral dose) off', difficile toxin A (lech Lab). Prcimmune IgY 
mixed with toxin A served as a negative control. Antibodies raised against C. difficile toxoid 
A (Example 8) mixed with toxin A (CTA) served as a positive control. The mixture was 
incubated lor I hour at 37°C then orally administered to lightly etherized hamsters using an 
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1 8G feeding needle. The animals were then observed for ihe onset of diarrhea and death for 
a period of approximately 24 hours. The results are shown in Table 18. 

TABLE 18 

Study Outcome After 24 Hours 



Treatment group 


Healthy* 


Diarrhea-' 


Dead* 


Preimmune 


0 


0 


7 


ITA 


s 


0 


0 


Interval ft 


ft 


1 


0 


Interval -1 


0 


1 


ft 


Interval 1235 


0 


(l 


7 



Animal shows ho sign of illness. 

Animal developed diarrhea, but did not die. 

Animal developed diarrhea and died. 

Pre-treaimcni of toxin A with IgYs against Interval, 6 prevented diarrhea in 6 of 7 
hamsters and completely prevented death in ail 7. In contrast, as with preimmune IgY. luYs 
against Interval 4 and Interval 1235 had no effect on the onset of diarrhea and death in the 
hamsters. 



c) Quantification Of Specific Antibody Concentration In CTA 
And Interval 6 IgY PEG Preparations 

To determine the purity of IgY PEG preparations, an aliquot of a pM A 1870-2680 
(Interval 6) IgY PEG preparation was chromatographed using MPEC and a KW-803 siting 
column (Miodcx-i. T he resulting profile of absorbancc at .280 nm is shown in f igure 16. I he 
single large "peak .corresponds to the predicted MW of IgY. Integration of the area under the 
single large peak showed that greater than 95% of the protein elutcd from the column was 
present in this single peak. This result demonstrated that the majority (>9S%) of the material 
absorbing at 280 nm in the PEG preparation corresponds to [gY. Therefore, absorbance at 
280 nm can be used to determine the total antibody concentration in PEG preparations. 

To determine the concentration of Interval 6-specillc antibodies (expressed as percent 
of total antibody) within the CTA and pMA1870-2680 (Interval 6) PEG preparations, defined 
quantities of these antibody preparations were affinity purified on a pPA 1870-2680(11) (shown 
schematically in figure 15B) affinity column and the specific antibodies were quantified. In 
f igure I5B the following abbreviations arc used: pP refers to the pET23 vector (New England 
BioLabs); pM refers to the pMAI/ M -c vector (New England BioLabs): pG refers to the pGEX 
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vector (Pharmacia): pB refers to the PinPoint'" Xa vector (Promega): A refers to toxin A the 
numbers refer to the amino acid interval expressed in the clone. The solid black ovals 
represent the MBP: the hatched ovals represent glutathione S-transferase: the hatched circles 
represent the biotin tag: and HI IH represents the poly-histidine tag. 

An affinity column containing recombinant toxin A repeat protein was made as 
follows. I our ml of PBS-washcd Actigel resin (Sterogene, was coupled with 5., 0 mg of 
pl»A.X70-2680 inclusion body protem (prepared as described in Example , ,7) and dialed 
•nto PBS, in a 15 m« tube (Falcon, containing ,„(> final volume A.d-coup.i„e solution ( I M 
-sod.um cyanoborohydride). Aliquot, of the supernatant from the couphng reactions before 
and after couplmg. were assessed by Coomassie staining of 7.5% SDS-PAGF gels Based 
upon protein hand intensities, greater than 6 mg of recombinant protein was coupled tothe 
resm. Ihe resin was poured into a 10 ml column (BioRad). washed extensiveiv with PBS 
pre-eluted with 4 M guanidine-HCI (in ,0 mM Tris-HCI. p.l 8.0: 0.005% thimerosal, and re- 
equilibrated with PBS. The column was stored at 4°C". 

Aliquots of a pMA 1 870-2680 {Interval 6, or a CTA IgY polyclonal ami bod v 
preparation (PFIG prep, were affinity purillcd on the above alllni.y column as follows The 
column was attached to an UV moniior (ISCO) and washed wi,h PBS. , or P MAI870-,68(, 
i^Y puntlcation. a 2X PEG prep (11 Iter sterilized using a 0.45 p Hher: approximated 500 me 
total IgY, was applied. The column was washed with PBS until ,he baseline was re- 
established ( the column How-through was saved), washed will, BBSTween to elute 
nonspecillcally binding antibodies and re-equilibrated with PBS. |, ound antlboov W;IS ^ 
.rorn the column in 4 M guanidine-l ICI (in 10 mM I ris-. Id. P H 8.0: (,.005% th.merosal, 
I he entire elution peak was collected in a I 5 ml lube , Falcon The column uas re- 
cquilibraled and the column eluate was re-chromatographed as described above. The antibodv 
preparation was quantified by l. : V absorbance (the elution buffer was used to zero the 
spectrophotometer). Total purified antibodv was approximately 0 ,„g and « mu from the firs, 
and second chromatography passes, respectively. The low yield from the second pass 
indicated tha, most spec.fic antibodies were removed by the llrst round of chromatography 
The estimated percentage of Interval 6 specific antibodies in ,he pMA 1 870-26X0 PF.G prep is 
approximately 2%. 

The percentage of Interval 6 specific antibodies in the C TA PF.G prep was determined 
(ut.hzmg the same column and methodology described above, »o be approx.matciv 0 5% of 
total li-Y. 
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A 4X PEG prep contains approximately 20 mg/ml la Y. Thus in b) above, 
approximately 400 M g specific antibody in the Interval 6 PEG prep neutralized 30 ug toxin A 



in vivo. 



EXAMPLE 16 

In Vivo Treatment Of C. difficile Disease In 
Hamsters By Recombinant Interval 6 Antibodies 

The ability „f antibodies directed against recombinant Interval 6 of toxin A to protect 
hamsters in vivo from ( '. difficile disease was examined. T his example involved: (a) 
prophylactic- treatment of ( difficile disease and (b) therapeutic treatment of ( " difficile 
disease. 

a) Prophylactic Treatment Of C. difficile Disease 

I his experiment was performed as described in Example «(b>. Three groups each 
consisting of 7 female 100 gram Syrian hamsters (Charles River l were prophyluctically treated 
uith either prcimmune IgYs. IgYs against native toxin A and B fCTAB: see Example K (a) 
and (b)| or IgYs against Interval 6. IgYs were prepared as 4X PEG preparations as described 
in Example 9(a). 

The animals were orally dosed 3 times daily, roughly at 4 hour intervals, for 12 days 
with 1 ml antibody preparations diluted in Ensure*. Using estimates of specific antibody 
concentration from Example 15(c). each dose of the Interval 6 antibody prep contained 
approximately 400 M g of specific antibody. On day 2 each hamster was predisposed to C.V 
difficile infection by the oral administration of 3.0 mg of C lindamycin-I K l (Sigma) in I ml 
of water. On day 3 the hamsters xvere orally challenged with I ml of C. difficile inoculum 
strain ATCC 43596 in sterile saline containing approximately 100 organisms. The animals 
uere then observed for the onset of diarrhea and subsequent death during the treatment 
period. The results are shown in Table 1 9. 
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TABLE 19 



Treatment Group 


Number Animals Alive 


Number Animals Dead 


Prcimmunc 


0 


7 


CTAU 


6 


1 


Interval 6 


7 


0 



10 



15 



20 



Treatment of hamsters with orally-administered lg Ys against Interval 6 successfully 
protected 7 out or 7 ( 1 00%) of the animals from < difficile disease. One of the hamsters in 
this group presented with diarrhea which subsequently resolved during the course of 
treatment. As shown previously in Example 9. antibodies to native toxin A and toxin B were 
highly protective. In this Example. .6 out of 7 animals survived in the C TAB treatment group. 
All of the hamsters treated with prcimmune sera came down with diarrhea and died. The 
survivors. i„ both the (.TAB and Interval 6 groups remained healthy throughout a 12 day post- 
treatment period. In particular. 6 out of 7 Interval 6-trcatcd hamsters survived at least 2 
weeks after termination of treatment which suggests that these antibodies provide a long- 
lasting cure. These results represent the first demonstration that antibodies generated against a 
recombinant region of toxin A can prevent CIMD when administered passivch to animals. 
These results also indicate that antibodies raised against Interval 6 alone may he sufficient to 
protect animals from t \ difficile disease when administered prophylactically. 

Previously others had raised antibodies against toxin A by actively immunizing 
hamsters against a recombinant polypeptide located within the Interval 6 region |l.yerly. 
D.M.. a ul. (1990) Curr. Microbiol. 21:291.. Figure 17 shows schematically the location of 
the l.yerly. « ul. inira-lntcrval 6 recombinant protein (cloned into the pUC vector) in 
comparison with the complete Interval 6 construct (pMA 1 870-2680) used herein to generate 
neutralizing antibodies directed against toxin A. in Figure 1 7. the solid black oval represents 
the MBP which is fused to the toxin A Interval 6 in pMA 1 870-2680. 

The Lyerly. a <,/. antibodies (intra-lnterval 6) were only able to partially protect 
hamsters against C. difficile infection in terms of survival (4 out of 8 animals survived) and 
furthermore, these antibodies did not prevent diarrhea in any of the animals. Additionally, 
animals treated with the intra-lntcrvai 6 antibodies [I.ycrly. cl al (1990). .„,,„-„] died when 
treatment was removed. 

In contrast, the experiment shown above demonstrates that passive administration of 
anti-Interval 6 antibodies prevented diarrhea in 6 out of 7 animals and completely prevented 
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death due to CDAD. Furthermore, as discussed above, passive administration of the anti- 
Interval 6 antibodies provides a long lasting cure (i.e.. treatment could be withdrawn without 
incident >. 



b) Therapeutic Treatment Of C difficile Disease: In Vi vo 
Treatment Of An Established C. difficile Infection In 
Hamsters With Recombinant Interval 6 Antibodies 

The ability of antibodies against recombinant interval 6 of toxin A to therapeutically 
treat ( \ difficile disease was examined. The experiment was performed essentially as 
described in Example 10(b). Three groups, each containing seven to eight female Golden 
Syrian hamsters ( 100 g each: Charles River) were treated with either preimmune IgY. IgYs 
. against native toxin A and toxin B (CTAB) and IgYs against Interval 6. The antibodies were 
prepared as described above as 4X PF.Ci preparations. 

The hamsters were ilrst predisposed to C. difficile infection with a 3 nig dose of 
C'lindamycin-HC'l (Sigma) administered orally in I ml of water. Approximately 24 hrs later, 
the animals were orally challenged with I ml olC difficile strain .ATCC* 4359ft in sterile 
saline containing approximately 200 organisms. One day after infection, the presence of toxin 
A and B was determined in the feces ot the hamsters using a commercial immunoassay kit 
(Cytoclonc A+B !£PA. C ambridge Biotech) to verify establishment of infection. I our 
members of each group were randomly selected and tested. Feces from an uninfected hamster 
was tested as a negative control. All infected animals tested positive lor the presence of toxin 
according to the manufacturer s procedure. The initiation of treatment then started 
approximately 2-4 hr post-infection. 

The animals were dosed daily at roughly 4 hr intervals with I ml antibody preparation 
diluted in l-nsurcK: (Ross Labs). The amount of specific antibodies given per dose 
(determined by affinity purification) was estimated to be about 400 ug of anti-Interval 6 IgY 
dor animals in the Interval 6 group) and 100 ug and 70 ug of anti-toxin A (Interval 6- 
specifk) and anti-toxin B (Interval 3-spccific: see Example 19). respectively, for the CTAB 
preparation. The animals were treated for 9 days and then observed for an additional 4 davs 
lor the presence of diarrhea and death. The results indicating the number of survivors and the 
number of dead 4 days post- in fee lion are shown in Table 20. 
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TABLE 20 



Trcaimenc Group 


Number Animals Alive 


Number Animals Dead 


Prcimmtine 


4 


3 


CTAB 


8 


i) 


Interval 6 


8 


(» 



Antibodies directed against both Interval 6 and CTAB successfully prevented death 
from ( \ difficile when therapeutically administered 24 hr alter infection. This result is 
significant since many investigators begin therapeutic treatment of hamsters with existing 
drugs (c-.y.. vancomycin, phenelfamycins. tiacumicins. etc.) X hr post-infection [Swanson. vt 
ol. (I9"l) Antimicrobial Agents and Chemotherapy 35:1108 and (1989) J. Antibiotics 42:94| 

lorty-lwo percent of hamsters treated with preimmune IgY died from (.'DAD. While 
the ami-Interval 6 antibodies prevented death in the treated hamsters, they did not eliminate 
all symptoms of CD AD as 3 animals presented with slight diarrhea. In addition, one CTAB-' 
treated and one preimmunc-trealed animal also had diarrhea 14 days rust-infection. These 
results indicate that anti-Interval 6 antibodies provide an effective means of therapy for 
CDAl). 

EXAMPLE 17 

Induction Of Toxin A Neutralizing Antibodies Requires Soluble Interval 6 Protein 

As shown in [ixamples I 1(d) and 15. expression of recombinant proteins in E. volt 
may result in the production of either soluble or insoluble protein. If insoluble protein is 
produced, the recombinant protein is solubilizcd prior to 'immunization of animals. To 
determine whether, one or both of the soluble or insoluble recombinant proteins could be used 
to generate neutralizing antibodies to toxin A. the following experiment was performed This 
example involved a) expression of the toxin A repeats and subfragmenls of these repeals in II. 
cali using a variety of expression vectors: b) identification of recombinant toxin A repeats and 
sub-regions to which neutralizing antibodies bind: and c) determination of the neutralization 
ability of antibodies raised against soluble and insoluble toxin A repeat immunogen. 
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a) Expression Of The Toxin A Repeats And Stibfragments Of 
These Repeats In coli Using A Variety Of Expression 
Vectors i 

The Interval 6 immunogen utilized in Examples 1 5 and 1 6 was the pMA 1 870-9680 
protein, in which the toxin A repeats arc expressed as a soluble fusion protein with the MBP 
(desenbed in Sample II). Interestingly, expression of this region ( from the Spcl site to the 
end o. the repeats, see Figure 1 5B) in three other expression constructs, as either native 
<pPAI870-268<». poly-,!i s tagged fpPA 1 870-2680 (I I)) or biotin-tagged (pBA1870-^80) 
protens resulted in completely insoluble protein upon induction of the bacterial host (see 
F.gure ,5B, The host strain BI.2I (Novagcn) was used lor expression of pBA 1 870-2680 and 
»os, strain BI.21(DF,3> (Novagcn, was used for expression of pPA. 870-2680 and pPA«870- 
2680,1.,. These insoluble proteins accumulated to high levels in inclusion bodies I Expression 
o» recombinant plasmids in /• < „/, hos, cells grown in 2X YT medium was performed as 
described | Williams, el a/. (1995 >. supra\. 

As summarized in Figure I5B. expression of fragments of the toxin A repeats (as 
Cher N-tcrmina. .S/,,I-/^,R| fragments, or C-terminal J?„,RI-cnd fragments, also yielded 
high levels of insoluble protein using pGliX (p(i A 1 870-2 190). PinPoin«™-Xa (pB A 1870-2190 
uml PBA2250-2680, and pliT expression systems (pPA 1 870-2190). The pGlIX and pET 
expression systems are described in Example II. The PinPoin.«"-Xa expression svstem drives 
.he expression of fusion proteins in E. coli. f usion proteins from PinlW»-Xa vectors 
contain a biotin tag a, the amino-.erminal end and can be affinity purified Sofil.mk'*' Soft 
Release avidin resin ( Promega, under mild denaturing conditions ( 5 mM biotin). 

The solubility of expressed proteins from the pPG 1870-2 190 and pPAI870-2190 
expression construes was determined after induction of recombinant protein expression under 
condumns reported to enhance protein so.ubi.ity fThese conditions comprise .rowth of the 
host a, reduced temperature (30°C) and the utilization of high (I mM IPTG) or low (0.. mM 
IPFCi, concentrations of inducer [Williams a a,. (1995). .v,,,,,,,,. All expressed recombinant 
•«x.n A protein was insoluble under these conditions. Thus, expression of these .Vauments of 
U« toxm A repeats in plZT and pGHX expression vectors resu.ts in the production of inso.ub.e 
recombman, protein even when the hos, cells are grown at reduced temperature and usinu 
lower concentrations of the inducer. Although expression of these fragments in pMal vectors 
> .elded affinity purillable soluble fusion protein, the protein was either predominant 
"..soluble (pMA.870-2190) or unstable ( P MA2250-2650). Attempts ,o so.ubilize expressed 
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protein from the pMA 1 870-2 1 90 expression construct using reduced temperature or lower 
inducer concentration (as described above) did not improve fusion protein solubility. 

Collectively, these results demonstrate that expression of the toxin A repeat region in 
E. coti results in the production of insoluble recombinant protein, when Expressed as either 
large (aa 1870-2680) or small (aa 1870-2 190 or aa 2250-2680) fragments, in a variety of 
expression vectors (native or poly-his tagged pET. pGLX or PinPoint ,M - Xa vectors), utilizing 
growth conditions shown to enhance protein solubility. The exception to this rule were 
fusions with the MB I*, which enhanced protein solubility, either partially (pMA 1870-2 1 90) or 
fully ( pMA 1 870-2680). 



h) Identification Of Recombinant Toxin A Repeats And Sub- 
Regions To Which Neutralizing Antibodies Bind 

Toxin A repeat regions to which neutralizing antibodies bind were identified by 
utilizing recombinant toxin A repeat region proteins expressed as soluble or insoluble proteins 

1> to deplete protective antibodies from a polyclonal pool of antibodies against native C difficile 
toxin A. An //; vivo assay was developed lo evaluate proteins for the abilitv to bind 
neutralizing antibodies. 

I he rational for this assay is as follows. Recombinant proteins were first pre-mixed 
with antibodies against native toxin A (CTA antibody: generated in Example X) and allowed 

20 to react. Subsequently. C. difficile toxin A was added at a concentration lethal to hamsters 
and the mixture was administered lo hamsters via IP injection. If the recombinant protein 
contains neutralizing- epitopes, the CTA antibodies would lose their ability to bind toxin A 
resulting in diarrhea and/or death of the hamsters. 

file assay was performed as follows. The lethal dose of toxin A when delivered oraljy 

2:> to nine 40 to 50 g Golden Syrian hamsters (Sasco) was determined to be 10 to 30 ng. The 

I'FXj-purhlcd CTA antibody preparation was diluted to 0.5X concentration H e . the antibodies 
were diluted at twice the original yolk volume) in 0.1 M carbonate buffer. pH 9.5. The 
antibodies were diluted in carbonate buffer to protect them from acid degradation in the 
stomach. The concentration of 0.5X was used because it was found lo be the lowest effective 

30 concentration against toxin A. The concentration of Interval 6-spccillc antibodies in the 0.5X 
CTA prep was estimated to be 10-15 Mg/mi (estimated using the method described in 
Lxample 15). 
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The inclusion body preparation [insoluble Interval 6 protein: pPA 1 870-2680(H)J and 
the soluble Interval 6 protein [pMA 1 870-2680: see Figure I5| were both compared for their 
ability to bind to neutralizing antibodies against C difficile toxin A (CTA). Specifically. I to 
2 mg ol recombinant protein was mixed with 5 ml of a 0 : 5X CTA antibody prep (estimated 
to contain 60-70 U g of Interval 6-specilk antibody). Alter incubation for I hr at 37°C. CTA 
(Tech l.ab) at a final concentration of 30 ug/ml was added and incubated for another I hr at 
37°C. One ml ol this mixture containing 30 ug of toxin A (and 10-15 ug of Interval 6- 
spccillc antibody) xvas administered orally to 40-50 g Ciolden Syrian hamsters (Sascoj. 
Recombinant proteins that result in the loss of neutralizing capacity of the CTA antibody 
would indicate that those proteins contain neutralizing epitopes. Preimmune and CTA 
antibodies (both at 0.5X) without the addition or any recombinant protein served as negative 
and positive controls, respectively. 

Two other inclusion body preparations, both expressed as insoluble products in the 
pCT vector, were tested: one containing a different insert (toxin R fragment) other than 
Interval 6 called pPBl 850-2070 (see Figure IK) -which serves as a control for insoluble 
Interval 6. the other was a truncated version of the Interval 6 region called pI'A 1 870-2 190 
(see Figure I5M). The results of this experiment arc shown in Table 21. 

TABLE 21 



Treatment Group 1 


Healthv : 


Diarrhea" 


/Mier j-# nours 
Dead 4 


i Yam nui no Ah 


0 




-> 


CTA Ab 


4 


1 


0 


CTA Ah - Im h (soluble) 


1 


"> 




CTA Ab - Im 6 (insoluble) 


s 


0 


0 


CTA Ah + pPB 1850-2070 


5 


(1 


0 


CTA Ah ^ pPA 1870-2 190 


5 


0 


0 



' < '. ilillkilv toxin A (CTA) was added to each group. 

Animals showed no signs of illness. 
Animals developed diarrhea bill did not die. 
Animals developed diarrhea and died. 

I'reimmune antibody was ineffective against toxin A. while ami-CTA antibodies at a 
dilute 0.5X concentration almost completely protected the hamsters against the emerotoxic 
cl feels of CTA. The addition of recombinant proteins pPB 1 850-2070 or pl'AI870-2!90 to 
the anti-CTA antibody had no effect upon its proteclive ability, indicating that these 
recombinant proteins do not bind to neutralizing antibodies. ()„ the other hand, recombinant 
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Interval 6 protein was able to bind to neutralizing anti-CTA antibodies and neutralized the in 
vivo protective effcet of the anti-CTA antibodies. Four out of five animals in the group 
treated with anti-CTA antibodies mixed with soluble Interval 6 protein exhibited toxin 
associated toxicity (diarrhea and death) Moreover, the results showed that Interval 6 protein 
must be expressed as a soluble product in order for it to bind to neutralizing anti-CTA 
anybodies since the addition of insoluble Interval 6 protein had no effect on the ncutralizim; 
capacity of the CTA antibody prep. 

c) Determination Of Neutralization Ability Of Antibodies 
Raised Against Soluble And Insoluble Toxin A Repeat 
Immunogen 

To determine if neutralizing antibodies are induced against solubilized inclusion 
bodies, insoluble toxin A repeat protein was solubilized and specific- antibodies were raised in 
chickens. Insoluble pl'A1870-2680 protein was solubilized using the method described in 
Williams .-/ a/. ( 1995). supra. Briefly, induced cultures (500 ml) were pelleted by 
centri.ugation a, 3.000 X g for 10 min a. 4°C. The cell pellets were resuspended thoroughly 
in 10 ml of inclusion body sonication buffer (25 mM HFPES P H 7.7. 100 mM KCI. 12 5 mM 
MgCI.. 20% glycerol. 0.1% (v/v> Nonidct P-40. 1 mM DTD. The suspension was transferred 
to a 30 ml non-glass centrifuge tube, Five hundred ul of 10 mg/m| lysozyme was added and 
the tubes were incubated on ice for 30 min. The suspension was then -frozen at -70°C for at 
least I hr. The suspension was tliawcd rapidly in a water bath at room temperature and then 
placed on ice. The suspension was then sonicated using at least eight 15 sec bursts of the 
mieroprobe (Branson Sonicator Model No. 450) with intermittent cooling on ice. 

The sonicated suspension was transferred to a 35 ml Oakridgc tube and ccmriluged at 
6.000 X g |«r 10 min at 4°C to pellet the inclusion bodies. The pellet was washed 2 times by 
pipetting or vortexing in fresh, ice-cold R.IPA buffer |0.1% SOS. 1% Triton X-100. 1% 
sodium deoxycholate in TBS (25 mM Tris-CI pi I 7.5. 150 mM NaCI)|. The inclusion bodies 
were recentrifuged after each wash. The inclusion bodies were dried and transferred using a 
small metal spatula to a 15 ml tube (Falcon). One ml of 10% SDS was added and the pellet 
was solubilized by gently pipetting the solution up and down using a 1 ml micropipets. The 
solubilization was facilitated by heating the sample to 95°C when necessary. 

Once the inclusion bodies were in solution, the samples were diluted with 9 volumes 
of PBS. The protein solutions were dialyzed overnight against a 100-fold volume of PBS 
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containing 0:05% SDS at room temperature. The dialysis buffer was then changed to PBS 
containing 0.01% SDS and the samples were dialvzed for several hours to overnight at room 
temperature. The samples were stored at 4°C until used. Prior to further use. the samples 
were warmed to room temperature to allow any precipitated SDS to go back into solution. 

The inclusion body preparation was used to immunize hens. The protein was dialyzed 
into PUS and emulsified with approximately equal volumes of CPA for the initial 
immunization or 1 1- A lor subsequent booster immunizations. On day zero, for each of the 
recombinant recombinant preparations, two egg laying white Leghorn hens were each injected 
a. multiple sites <IM and SO with I ml of recombinant protein-adjuvant mixture containing 
approximately 0.5- 1 .5 mg of recombinant protein. Booster immunizations of 1 .0 mg were 
given of days 14 and day 28. Kggs.wcre collected on day 32 and the antibody isolated using 
,M£G aS dusc "k*» i". Example 14(a). High liters of toxin A specific antibodies were present 
(as assayed by CUSA. using the method described in Example -13). Titers were determined 
for both aniibodies against recombinant polypeptides pPA 1870-2680 and pMA 1870-2680 and 
were found to Ik- comparable at > 1:62.500. 

Aniibodies against soluble Interval 6 (pMA 1 870-2680) and insoluble Interval 6 
Kinclusion body). pl'A1870-2680| were tested for neutralizing ability against toxin A using 
the in vivo assay described in Example 15(b). Preimmune antibodies and antibodies against 
toxin A (C TA) served as negative and positive controls, respectively. The results are shown 
in Table 22. 

TABLE 22 



Antibody Treatment Oroup 


Healtln' 


. -r . . .1UIU V Willi 

Diarrhea" 


imijc /Miei i tours 
Dead 1 


rrcinimiinc 


1 


0 


1 


CTA 


5 


0 


(1 


Interval 6 (Soluble) 1 


5 


(J 


0 


Interval 6 (Insoluble) | 


0 







' Animals showed no si«n of illness. 

Animal developed diarrhea but did not die. 
Animal developed diarrhea and died. 
■iOO iii: ml. 

Antibodies raised against native toxin A were protective while preimmune antibodies 
had little effect. As found using the in vitro C HO assay [described in Example 8(d)} where 
antibodies raised against the soluble Interval 6 could partially neutralize the effects of loxin A. 
here they were abie to completely neutralize toxin A in viva. In contrast, the antibodies 
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raised against the insoluble Interval 6 was unabje to neutralize the effects of toxin A in vivo 
as shown above (Table 22) and in vitro as shown in the CHO assav [described in Example 
8««>I | 

These results demonstrate that soluble toxin A repeat immunogen is necessary to 
induce the production of neutralizing antibodies in chickens, and thai the generation of such 
soluble immunogen is obtained only with a specific expression vector (pMal) containing the 
toxin A region spanning aa 1870-2680. That is ,o say. insoluble protein that* is subsequently 
xolubilized does not result in a toxin A antigen that will elicit a neutralizing antibody. 

EXAMPLE 18 

t 

Cloning And Expression Of The ( \ difficile Toxin B Gene 

t he toxin B gene has been cloned and sequenced: the amino acid sequence deduced 
from the cloned nucleotide sequence predicts a MW of 269.7 kD lor toxin B |Barroso ei ai . 
Nucl. Acids Res. IX:40()4 (1990)}. The nucleotide sequence of the coding region of the entire 
toxin B gene is listed in SEQ ID NO:9. The amino acid sequence of the entire toxin B 
protein is listed in SEQ ID NO: 10. The amino acid sequence consisting of amino acid 
residues 1X50 through 2360 of toxin B is listed in SEQ ID NO: 11. The ammo acid sequence 
consisting of amino acid residues 1750 through 2360 of toxin B is listed in SI-Q ID NO: 1 2. 

Given the expense and difficulty of isolating native toxin B protein, it would be 
advantageous to use simple and inexpensive procaryotic expression systems to produce and 
purify high levels of recombinant toxin B protein for immunization purposes. Ideally, the 
isolated recombinant protein would be soluble in order to preserve native antigenicity, since 
solubilizcd inclusion body proteins often do not fold into native conformations. Indeed as 
shown in Example 17. neutralizing antibodies against recombinant toxin A were only obtained 
when soluble recombinant toxin A polypeptides were used as the immunogen. To allow ease 
of purification, the recombinant protein should be expressed to lewis greater than I mg/Iiter 
'ol E. cofi culture. 

To determine whether high levels of recombinant toxin B protein could be produced in 
/:". colt, fragments of the toxin B gene were cloned into various prokaryoiic expression 
vectors, and assessed for the ability to express recombinant toxin B protein in E. coli. I his 
Example involved (a) cloning of the toxin B gene and (b) expression of the toxin B gene in 
/.'. coli. 
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a) Cloning Of The Toxin B Gene 

The toxin B gene was cloned using PCR amplification from C. difficile genomic DNA. 
Initially, the gene was cloned in two overlapping fragments, using pr'imer pairs P5/P6 and 
P7/P8. The location of these primers along the toxin 3 gene is shown schematically in Figure 
IK. The sequence of each of these primers is: 1*5: 5" TAGAAAAAATGGCAAATGT 3" (SEQ 
ID NO: I I ): P6. 5" TTTCATCTTGTA GAGTCAAAG .V (SEQ ID NO:l2): 
P7: 5" GATGCCACAAGATGATTTAGTG 3" (SEQ ID NO: 1 3): and 
P8: 5" CTAA TTGAGCTGTATCAGGATC 3' (SEQ ID NO: 1 4). 

f igure 18 also shows the location of the following primers along the toxin B gene: P9 
which consists of the sequence 5" CGCAATTCCTAGAAAAAATGGCAA A I G ,V (SEQ ID 
NO: 1 5): IMO which consists of the sequence 5" GCTC.'TAGAATGA CCATAAGCTA'GCCA 
3 (SEQ ID NO: 1 6): PI I which consists of the sequence 

-V ( GCiAA n CGAGTTGGTAGAAAGGTGGA .V (SEQ ID NO: 1 7): PI3 which consists of 
the sequence 5" CGGAATTCGGTTATTATCTTAAGGATG 3" (SEQ ID NO: 18): and P 14 
15 which consists of the sequence 5" CGGAATTCTTGATAACTGGAT TTGTGAC .V (SEQ ID 
NO: 19). I lie amino acid sequence consisting of amino acid residues 1852 through 2362 of 
toxin li is listed in SEQ ID NO:20. The amino acid sequence consisting of amino acid 
residues 1-755 through 2362 of toxin H is listed in SEQ ID NO:21. 

Clostridium difficile VPI strain 10463 was obtained from the American Type Culture 
20 C ollection (ATC'C 43255) and grown under anaerobic conditions in brain-heart infusion 

medium (Becton Dickinson). High molecular-weight C. difficile DNA was isolated essentially 
as described [Wren and Tabaqchali (1087) J. Clin. Microbiol.. 25:2402 1. except 1) 100 ug/ml 
proteinase K in 0.5% SDS was used to disrupt the bacteria and 2) eeiytrimeihylammonium 
bromide (( TAB) precipitation |as described by Ausubel et a!.. Eds.. < uneni Protocols in 
Molecular liiolo K v. Vol. 2 (1989) Current Protocols! was used to remove carbohydrates from 
the cleared lysate. Briclly. after disruption of the bacteria with proteinase K. and SDS. the 
solution is adjusted to approximately 0.7 M NaCI by the addition ol a 1/7 volume of 5M 
NaCI. A 1/10 volume of CTAB/NaCI (10% C'I'AB in 0.7 M NaCI) solution was added and 
the solution was mixed thoroughly and incubated 10 min at 65°C. An equal volume of 
chloroform/isoamyl alcohol (24:1 ) was added and the phases were thoroughly mixed. The 
organic and aqueous phases were separated by centrifugation in a microliige for 5 min. The 
aqueous supernatant was removed and extracted with phenol/chloroform/ isoamyl alcohol 
(25:24:1 ). The phases were separated by centrifugation in a micro/uge for 5 min. The 
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supernatant was transferred to a fresh tube and the DNA was precipitated with isopropanol 
The DNA precipitate was pelleted by brief ccntrifugation in a microfugc,' The DNA pellet 
was washed with 70% ethanol ,« remove residual CTA 13. The DNA pellet was then dried 
and redissolvcd in TK buffer ( 10 n ,M Tris-HCI P H8.0. I ,„M HDTA). The i„, CU ritv and 
yield of genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA alter electrophoresis on an agarose gel. 

Toxin B fragments were cloned by PCR utilizing a proofreading thermostable DNA 
polymerase (native /> polymerase (Stratagene,|. The high fidelity of ,h is poJvmcra.se 
reduces the mutation probleu 1S associated with amplication by error prone polvmcrascs <„ * 
T« q polymerase,. PCR amplification was performed using the PCR primer pairs P5 (SI Q ID 
NO:l I , with Pf, ( STQ ID NO:12> and P7 (SKQ ID NO: 13) with P8 (SHO ID NO:l4, in 50 
nl reactions containing 10 „,M Tris-HCI pH8.3. 50 mM KCI. 1.5 mM MgCI.. 200 uM of 
each dNTP. 0.2 uM each primer, and 50 ng C. difficile genomic DNA. Reactions were 
overlaid with 100 ul mineral oil. heated to 94°C for 4 min. 0.5ul native Pfi, polymerase 
(S.ratayene) uas added, and the reactions were cycled 30 times a. 94°C for 1 min. 50°C for I 
min. 72°C <2 min for each kb of sequence to be amplified), followed by 10 min at 72°C. 
Duplicate reactions were pooled, chloroform extracted, and ethanol precipitated: After 
washing in 70% ethanol. the pellets were ^suspended in 50 „| Ti; buffer ( 10 mM Tris-IICI 
pH8.0. 1 mM FiDTA). 

The P5/P6 amplification product uas cloned into pUCI 9 as outlined below. 10ul 
aliquots of DNA were gel puri lied using the Prep-a-Gene kit (BioRad). and ligated to .S>„„| 
restricted P UCI9 vector. Recombinant clones were isolated and confirmed by restriction 
digestion using standard recombinant molecular biology techniques (Sambrook <•/ <,/.. 1989). 
Inserts from two independent isolates were identified in which the toxin B insert was oriented 
such that the vector HamlU and Sad sites were 5" and .V oriented, respectively (pUCBlO- 
1530). The insert-containing DamHVSacI fragment was cloned into similarly cut pl-T23a-c 
vector DNA. and protein expression was induced in small scale cultures (5 ml) of identified 



I ones. 



Total protein extracts were isolated, resolved on SDS-PACili gels, and toxin B protein 
identified by Western analysis utilizing a goal anti-toxin B affinity purified antibody (Tech 
Lab). Procedures for protein induction. SDS-PAGt. and Western blot analysis were 
performed as described in Williams c at. (1995). x , w „. |„ brief. 5 ml cultures of bacteria 
grown in 2XYT containing 100 M g/ml ampicillin containing the appropriate recombinant clone 
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were induced to express recombinant protein by addition ot lPTG to ImM. The t\ coli hosts 
used were: BL2UDE3) or BL21(DE3)LysS (Novagen) for pET plasmids. 

Cultures were induced by the addition of IPTG to a final concentration of 1 .0 mM 
when the cell density reached 0.5 OD MK1 . and induced protein was allowed to accumulate for 
two his after induction. Protein samples were prepared by pelleting I mi aliquots of bacteria 
by centrifugation ( I min in microfuge). and resuspension of the pelleted bacteria in 150 ul of 
2X SDS-PAGE sample buffer (0.125 mM Tris-HCl pH 6.8. 2 mM EDTA. 6% SDS. 20% 
glycerol. 0.025% bromophenol blue: l>mercaptoethanol is added to 5% before use). The 
samples were heated to «5°C for 5 min. then cooled and 5 or 10 M ls loaded on 7.5% SDS- 
PAGE gels. I ligh molecular weight protein markers (BioRad) were also loaded, to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected either generally by staining the gels with Coomassie Blue, or specifically, by blotting 
to nitrocellulose for Western blot detection of specific immunoreactive protein. The MW of 
induced toxin B reactive protein allowed the integrity of the toxin B reading frame to be 
determined. 

The pii 123b recombinant (pPB 1 0- 1 530) expressed high MW recombinant toxin B 
reactive protein, consistent with predicted values. This confirmed that Iramc terminating 
errors had not occurred during PGR amplification. A P ET23b expression clone containing the 
IO-1750aa interval of the toxin B gene was constructed, by fusion of the £coRV-S/k>1 
fragment ol the P7/P8 amplification product to the P5-£t-«RV interval of the P5/P6 
amplification product (see figure 18) in pPBIO-1530. The integrity of this clone (pPBJO- 
1750) was confirmed by restriction mapping, and Western blot detection of expressed 
recombinant toxin B protein. Levels of induced protein from both pPBIO-1530 and pPBIO- 
1 750 were loo low to facilitate purification of usable amounts of recombinant toxin B protein. 
The remaining 1 750-2360 aa interval was directly cloned into pMAL or P KT expression 
vectors from the P7/P8 amplification product as described below. 

H) Expression Of The Toxin II Cienc 

i) Overview Of Expression Methodologies 

fragments of the toxin B gene were expressed as either native or fusion proteins in E. 
coli. Native proteins were expressed in either the p£T23a-e or pETI6h expression vectors 
(Novagen). The pET23 vectors contain an extensive polylinker sequence in all three reading 
frames (a-c vectors) followed by a C-terminal poly-histidine repeat. The pl-T16h vector 
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contains a N-terminal poly-histidine sequence immediately 5' to a small polylinker. The 
poly-histidine sequence binds to Ni-Chelate columns and allows affinity purification of tagged 
target proteins [Williams et a/. (1995). supra]. These affinity tags are srnall (10 aa for 
P ETI6b. 6 aa for P ET23) allowing the expression and affinity purification of native proteins 
with only limited amounts of foreign sequences. 

An N-terminal histidine-tagged derivative of P ET 1 6b containing an extensive cloning 
cassette was constructed to facilitate cloning of N-terminal poly-histidine tagged toxin B 
expressing constructs. This was accomplished by replacement of the promoter region of the 
pl-T23a and b vectors with that of the pET16b expression vector. Each vector was restricted 
with B K I\\ and Ndel. and the reactions resolved on a 1.2 % agarose gcl. The P ETl6b 
promoter region (contained in a 200 bp ^/IF-AV/d fragment) and the promoter-less pET23 a 
or b vectors were cut from the gel. mixed and Prep- A-Ciene (BioRad, purified. The eluted 
DNA was ligated. and transformants screened for promoter replacement by AV«I digestion of 
purified plasmid DNA (the pETI6b promoter contains this site, the pET23 promoter does 
not,. These clones (denoted pETHisa or b) contain the p'ET 1 6b promoter (consisting of a 
P T7-lac promoter, ribosome binding site and poly-histidine ( lOaa) sequence, (used at the Nde\ 
site to the extensive pET23 polylinker. 

All MBP fusion proteins were constructed and expressed in the pMAI. ,M -c or 
pMAI.'>'-p2 vectors (New England Biolabs) in which the protein of interest is expressed as a 
C-terminal fusion with MB P. All pET plasmids were expressed in either the B1.21(DE3> or 
BL2l(I)E3,|.ysS expression hosts, while pMal plasmids were expressed in the BI.2I host. 

l arge scale (500 mis to 1 liter, cultures of each recombinant were grown in 2X Y l 
broth, induced, and soluble protein fractions were isolated as described | Williams, et al. 
(1995). supra]. The soluble protein extracts were affinity chromatographed to isolate 
recombinant fusion protein, as described [Williams e , al.. (1995) supra]. In brief, extracts 
containing tagged pET fusions were chromatographed on a nickel chelate column, and eluted 
using imidazole salts or low pi I (pll 4.0) as described by the distributor (Novaucn or Qiagen). 
Extracts containing soluble pMAl. fusion protein were prepared and chromatographed in PBS 
buffer oxer an amylose resin (New England Biolabs, column, and eluted with PBS containing 
10 mM maltose as described | Williams ei al. (1995). supra]. 
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ii) Overview Of Toxin B Expression 

In both large expression constructs described in (a) above, only low level </.<•. less 
, than I mg/liter of intact or nondegraded recombinant protein) expression of recombinant 
protein was detected. A number of expression constructs containing smaller fragments of the 
toxin B gene were then constructed, to determine if small regions of the gene can be 
expressed to high levels (i.e.. greater than I mg/liter intact protein) without extensive protein 
degradation. All were constructed by in frame fusions of convenient toxin B restriction 
fragments to either the P MAL-c. pLT23a-c. pLTI6b or pCTHisa-b expression vectors, or by 
engineering restriction sites at specific locations using PCR amplification (using the same 
conditions described in (a) above |. In all cases, clones were verified by restriction mapping, 
and. where indicated. DNA sequencing. 

Protein preparations from induced cultures of each of these constructs were analyzed, 
by SDS-l»A(il-:. to estimate protein stability (Coomassie Blue staining) and immunoreactivny 
against anti-toxin B specilic antiserum (Western analysis). Higher levels of intact (i.e.. 
nondegraded). full length fusion proteins were observed with the smaller constructs as 
compared with the larger recombinants, and a series of expression constructs spanning the 
entire toxin B gene were constructed (figures 18. 19 and 20 and Table 23). 

C onstructs that expressed significant levels of recombinant toxin B protein (greater 
than I mg/liter intact recombinant protein) in E. cult were identified and a series of these 
clones that spans the toxin B gene are shown in Figure 19 and summarized in fable 23. 
These clones were utilized to isolate pure toxin B recombinant protein from the entire toxin B 
gene. Significant protein yields were obtained from pMAL expression constructs spanning the 
entire toxin B gene, and yields of full length soluble fusion protein ranged from an estimated 
1 mg/liter culture (pMBI 100-1530) to greater than 20 mg/liter culture (pMBI 750-2360). 

Representative purifications of MBP and poly-histidinc-tagged toxin B recombinants 
are shown in figures 21 and 22. f igure 21 shows a Coomassie Blue stained 7.5% SDS- 
PAGL gel on which various protein samples extracted from bacteria harboring pMBI850- 
2360 were electrophoresed. Samples were loaded as follows: fane I: protein extracted from 
uninduced culture: Lane 2: induced culture protein: Lane 3: total protein from induced culture 
after sonication: Lane 4: soluble protein: and Lane 5: eluted affinity purified protein, figure 
22 depicts the purification of recombinant proteins expressed in bacteria harboring cither 
PPBI850-2360 (Lanes 1-3) or pPBI750-2360 (Lanes 4-6). Samples were loaded as follows: 
uninduced total protein (Lanes 1 and 4): induced total protein (Lanes 2 and 5): and eluted 
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affinity purified protein (Lanes 3 and 6). The broad range molecular weight protein markers 
(BioRad) are shown in Lane 7. 

Thus, although high level expression was not attained using large! expression constructs 
from the toxin B gene, usable levels of recombinant protein were obtained by isolating 
induced protein from a series of smaller pMAL expression constructs that span the entire 
toxin B gene. 

These results represent the first demonstration of the feasibility of expressing 
recombinant toxin B protein to high levels in E. col,. As well, expression ol small regions of 
•he putative ligand binding domain (repeat region) ol toxin B as native protein yielded 
insoluble protein, while large constructs, or fusions to MBP were soluble (Figure 19). 
demonstrating that specific methodologies are necessary to produce soluble fusion protein 
from this interval. 

Hi) Clone Construction And Expression Details 
A portion of the toxin B gene containing the toxin B repeat region | amino acid 
residues 1852-2362 of toxin B (SF.Q ID NO:20)| was isolated by l»CR amplification of this 
interval of the toxin B gene from C. difficile genomic DNA. The sequence, and location 
within the toxin B gene, of the two PC R primers |P7 (SF.Q ID NO: 13) and P8 (SKO ID 
NO:l4)| used to amplify this region are shown in Figure 18. 

DNA from the PC R amplification was purified by chloroform extraction and cthanol 
precipitation as described above. The DNA was restricted with .S), c |. and the cleaved DNA 
was resolved by agarose gel electrophoresis. The restriction digestion with ty t | cleaved the 
3.6 kb amplification product into a 1.8 kb doublet band. This doublet band was cut from the 
gel and mixed with appropriately cut. gel purified pMALc or pF.T23b vector. These vectors 
were prepared by digestion with /Will, filling in the overhanging ends using the Klenow 
enzyme, and cleaving with A7>«1 (pMALc) or Nhe\ (pET23b). The gel purified DNA 
fragments were purified using the Prep-A-Gene kit (BioRad) and the DNA was ligatcd. 
transformed and putative recombinant clones analyzed by restriction mapping. 

pi: I and ! pMal clones containing the toxin B repeat insert (aa interval I 750-2360 -of 
toxin B) were verified by restriction mapping, using enzymes thai cleaved specilic sites within 
the toxin B region. In both cases fusion of the toxin B Spci site with either the compatible 
.Yhal site (pMal) or compatible \/;c I site (pET) is predicted to create an in frame fusion. This 
was confirmed in the case of the pMB 1 750-2360 clone by DNA sequencing of the clone 
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junction and -5' end of the toxin B insert using .a MB P specific primer (New England 
Biolabs) In the case of the pET construct, the fusion of the blunt ended toxin B 3' end to 
the filled HintMl site should create an in-frame fusion with the C-terminal poly-histidine 
sequence in this vector. The pPB 1750-2360 clone selected had lost, as predicted, the HinJlU 
site at this clone junction; this eliminated the possibility that an additional adenosine residue 
was added to the 3" end of the PCR product by a terminal transferase activity of the Pfu 
polymerase, since fusion of this adenosine residue to the filled H'mdlW site would regenerate 
the restriction site (and was observed in several clones). 

One liter cultures of each expression construct were grown, and fusion protein purified 
by affinity chromatography on either an amylose resin column (pMAL constructs: resin 
supplied by New England Biolabs) or Ni-chelatc column (pET constructs; resin supplied by 
Qiagen or Novagen* as described [ Williams ct ai (1995). supra\. The integrity and purity of 
the fusion proteins were determined by Coomassie staining of SDS-PAGE protein gels as well 
as Western blot analysis with either an affinity purified goat polyclonal antiserum (Tech Lab). 
\* or a chicken polyclonal .PEG prep, raised against the toxin B protein (CTB) as described 

above in -Example 8. In both cases, affinity purification resulted in yields in excess of 20 nm 
protein per liter culture, of which greater than 90% was estimated to be full-length 
recombinant protein. It should be noted that the poly-histidine affinity tagged protein was 
released from the Qiagen Ni-NTA resin at low imidazole concentration (60 mM). 
10 necessitating the use of a 40 mM imidazole rather than a 60 mM imidazole wash step during 
purification. 

A periplasmically secreted version of pMB 1750-2360 was constructed by replacement 
ol the promoter and MBP coding region of this construct with that from a related vector 
(pMAL ,M -p2; New England Biolabs) in which a signal sequence is present at the N-ierminus 
of the MBP. such that fusion protein is exported. This was accomplished by substituting a 
Z?i:/M-Z:V<,RV promoter fragment from pMAL-p2 into pMBI 750-2360. The yields of secreted, 
affinity purified protein (recovered from osmotic shock extracts as described bv Riggs in 
C urrent Protocols in Molecular lliotoxy. Vol. 2. Ausubel. a c//.. Lids. (1989). C urrent 
Protocols, pp. 16.6.1 - 16.6.14) from this vector (pMBpl 750-2360) were 6.5 mg/litcr culture. 
30 of which 50% was estimated to be full-length fusion protein. 

I he interval was also expressed in two non-overlapping fragments. pMBI 750-1970 
was constructed by introduction of a frameshift into pMB 1 750-2360. by restriction with 
///Will, filling in the overhanging ends and reiigation of the plasmid. Recombinant clones 
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were selected by loss of the /Ym</III site, .and further restriction map analysis. Recombinant 
protein expression from this vector was more than 20 mg/liter of greater than 90% pure 
protein. 

The complementary region was expressed in pMB 1 970-2360. This construct was 
created by removal of the 1750-1970 interval of pMB 1 750-2360. This was accomplished bv 
restriction of this plasmid with EcoRI (in the pMalc polyliriker 5 to the insert, and III. filling 
in the .overhanging ends, and religation of the plasmid. The resultant plasmid. pMB 1970-2360, 
was made using both inlracellularly and secreted versions of the pMBI 750-2360 vector. 

No fusion protein was secreted in the pM Bp 1 970-2360 version, perhaps due to a 
conformational constraint that prevents export of the fusion protein. However, the 
intracellular^ expressed vector produced greater than 40mg/liter of greater than 90% full- 
length fusion protein. 

C onstructs to precisely express the toxin B repeats in either pMalc (pMBI 850-2360) or 
pi:TI6b (pPB 1 850-2360) were constructed as follows. The DNA interval including the toxin 
B repeats was I'CR amplified as described above utilizing PCR primers PI4 (SHQ 11) NO:l9) 
and P8 (SHQ ID NO: 14). Primer P 14 adds a Ac„RI site immediately flanking the start of the 
toxin B repeats. 

The amplified fragment was cloned into the pT7 Blue I - vector (Novagen) and 
recombinant clones in which single copies of the PCR fragment were inserted in cither 
orientation were selected < P T71 850-2360) and confirmed by restriction mapping. The insert 
was excised from two appropriately oriented independently isolated pT 71 850-2360 plasmids. 
with Ev»lU (5- end of repeats) and I'.ul (in the Hanking polylinkei of the vector), and cloned 
into /.YoRI//\v/| cleaved pMalc vector. The resulting construct ( pMB 1 850-2360, was 
confirmed by restriction analysis, and yielded 20 mg/l of soluble fusion protein [greater than 
90% intact {i.e.. nondegraded)j after affinity chromatography. Restriction of this plasmid with 
///Will and religation of the vector resulted in the removal of the 1970-2360 interval. The 
resultant construct ( P MB1850-I970) expressed greater than 70 mg/liter of 90% full length 
fusion protein. 
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The pPB 1 850-2360 construct was made by cloning a EcoRI (filled with Klenow)- 
BamHl fragment from a pT7 1 850-2360 vector (opposite orientation to that used in the 
, pMBl 850-2360 construction) into Mel (filled)/tfamHI cleaved pETI6b vector. Yields of 
affinity purified soluble fusion protein were 1 5 mgyiiter. of greater than 90% full length 
fusion protein. 

Several smaller expression constructs from the 1 750-2070 interval were also 
constructed in His-tagged pET vectors, but expression of these plasmids in the Bl 21 (DE3) 
host resulted in the production of high levels of mostly insoluble protein (see Table 23 and 
Figure 19): These constructs were made as follows. 

P PBI850-I970 was constructed by cloning a BgM-Himnil fragment of P PB! 850-2360 
into cleaved P ET23b vector. pPBI850-2070 was constructed by cloning a 

. /WII-/V„I| fragment of pPB 1 850-2360 into BglMIHincW cleaved P ET23b vector. pPB1750- 
1970(e) was constructed by removal of the internal HinctlU fragment of a pPB 1750-2360 
vector in which the vector Hindi II site was regenerated during cloning (presumably by the 
addition of an A residue to the amplified PCR product by terminal transferase activitv of />/„ 
polymerase). The P PBI750-I970(n) construct was made by insertion of the insert containing 
the A</ t >l-//,WI|| fragment of pPB 1 750-2360 into identically cleaved pETHisb vector. All 
constructs were confirmed by restriction digestion. 

An expression construct that directs expression of the 10-470 aa interval of toxin B 
was constructed in. the pMalc vector as follows. A NhA (a site 5' to the insert in the pi:T23 
vector )-A/l\\ (111 led ) fragment o f the toxin B gene from pPB 1 0- 1 530 was cloned into Xhal 
(compatible with MwWHbuMX (filled) pMalc vector. The integrity of the construct (pMBIO- 
470) was verified by restriction mapping and DNA sequencing of the 5" clone junction using 
a MBP specific DNA primer (New England Biolabs). However, all expressed protein was 
degraded to the MBP monomer MW. 

A second construct spanning this interval (aa 10-470) was constructed by cloning the 
PCR amplification product from a reaction containing the P9 (SEQ ID NO: 1 5) and P10 (SEQ 
10 NC):16) primers (Figure 18) into the pETHisa vector. This was accomplished by cloning 
the PCR product as an ftc«RI-blunt fragment into EcnM-HincW restricted vector DNA: 
recombinant clones were verified by restriction mapping. Although this construct (pPBIO- 
520) allowed expression and purification (utilizing the N-terminal polyhistidine affinity tag) of 
intact fusion protein, yields were estimated at less than 500 ug per liter culture. 
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Higher yield of recombinant protein from this interval (aa 1 0-520) were obtained by 
expression of the interval in two overlapping clones. The IO-330aa interval was cloned in 
both pHTHisa and pMalc vectors, using the BamH\-Afl\\\ (filled) DN A i fragment from pPBlO 
520. This fragment was cloned into Bam\\\-Hind\l\ (filled) restricted pMalc or BamH\-liinc\\ 
restricted pETHisa vector. Recombinant clones were verified by restriction mapping. High 
level expression of either insoluble (pLT) or soluble (pMal) fusion protein was obtained. Total 
yields of fusion protein from the pMBl 0-3.10 construct (Figure 18) were 20 mg/Uter culture, 
of which 10% was estimated to be full-length fusion protein. Although- yields of this interval 
were higher and >«0% full-length recombinant protein produced when expressed from the 
pET construct, the pMal fusion was utilized since the expressed protein was soluble and thus 
more likely to have the native conformation. 

The pMB260-520 clone was constructed by cloning licolU-Xhcd cleaved amplified 
DNA from a PCR reaction containing the PI I (SFQ ID NO. 1 7) and PI0 (SFQ.ID NO: 16) 
DNA primers (Figure 18) into similarly restricted pMalc vector. Yields of affinity purified 
protein were 10 mg/liter. of which approximately 50% was estimated to be lull-length 
recombinant protein. 

The aa5I0-l 1 10 interval was expressed as described below . This entire interval was 
expressed as a pMal fusion by cloning the Nhc\-llind\\\ fragment of pUCBlO-1530 into Xhal- 
IlimAU cleaved pMalc vector. The integrity of the construct (pMB510-l 1 10) was verified by 
restriction mapping and DNA sequencing of the 5" clone junction using a MBP specific DNA 
primer. The yield of affinity, purified protein was 25 mg/liter culture, ol which 5% was 
estimated to be lull-length fusion protein (I mg/liter I. 

lu attempt to obtain higher yields, this region was expressed in two fragments (aa5l0- 
820. and 820-1 1 10) in the pMalc vector. The pMB5 1 0-820 clone was constructed by 
insertion ol a Sad (in the pMalc polylinkcr 5* to the insert )-llpu\ DNA fragment from 
pMBSlO-l 1 10 into Sav\iStu\ restricted pMalc vector. I he pMB820-l I 10 vector was 
constructed by insertion of the Hpul -Wnd\ II fragment of pUCBIO-l 530 into BumHl (filled)/ 
Hint/Ill cleaved pMalc vector. The integrity of these constructs were verified by restriction 
mapping and DNA sequencing of the 5" clone junction using a MBP specific DNA primer. 

Recombinant protein expressed from the pMB5IO-820 vector was highly unstable. 
However, high levels (20 mg/liter) of >«()% full-length fusion protein were obtained from the 
pMB820-l 105 construct. The combination of partially degraded pMB510-l 1 10 protein 
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(enriched for the 510-820 interval) with the pMB820-l 1 10 protein provides usable amounts of 
recombinant antigen from this interval. ' 

The aal 100-1 750 interval was expressed as described below. 1 The entire interval was 
expressed in the pMalc vector from a construct in which the /ftd(fillcd)-.S/*.<I fragment of 
pPB 1 0- 1 750 was inserted into S,u\/Xba\ (Xbal is compatible with S,,cl : Stul and filled Acc\ 
sites are both blunt ended) restricted pMalc. The integrity of this construct (pMB 1 100-1750) 
was verified by restriction mapping and DNA sequencing of the clone junction using a MBP 
specific DNA primer. Although 15 mg/liter of affinity purified protein was isolated from cells 
harboring this construct, the protein was greater than 99% degraded to MBP monomer size. 

A smaller derivative of pMBI 100-1750 was constructed by. restriction ofpMBl 100- 
1750 with Aim and Sail (in the pMalc polvlinker 3" to the insert), filling in the overhanging 
ends, and relating the plasmid. The resultant clone (verified by restriction digestion and 
.UNA. sequencing* has deleted (he aal530-l750 interval, was designated pMBI 100-1530 
pMUl 100-1530 expressed recombinant protein at a yield of greater than 40 mg/liter. of which 
5% was estimated to be full-length fusion protein. 

Three constructs were made to express the remaining interval. Initially. a.Bsptil 
|filled)-A/,cl fragment from pPB10-1750 was cloned into £™Rl(filled)/.\7*,l cleaved pMalc 
vector. The integrity of this construct (pMBI 570- 1 750) was verified by restriction mapping 
and DNA sequencing of the 5" clone junction using a MBP specific DNA primer! Expression 
of recombinant protein from this plasmid was very low. approximately 3 mg affinity purified 
protein per liter, and most was degraded to MBP monomer size. This region was 
subsequently expressed from a PCR amplified DNA fragment. A PCR reaction utilizing 
primers P13 |SEQ ID NC):I8: P13 was engineered to introduce an licoRI site 5" to amplified 
toxin B sequences! and P8 (SMQ ID N():I4) was reformed on C. difficile genomic DNA as 
described above. The amplified fragment was cleaved with EcoKL and Spel. and cloned into 
EcnlWXbal cleaved pMalc vector. The resultant clone (pMB 1 530- 1 750) was verified by 
restriction map analysis, and recombinant protein was expressed and purified. The yield was 
greater than 20 mg protein per liter culture and it was estimated that 25% was full-length 
fusion protein: this was a significantly higher yield than the original pMB 1 570- 1 750 clone. 
The insert of pMB 1 530- 1 750 (in a EcoRUSalX fragment) was transferred to the pliTHisa 
vector (EcoRWXhol cleaved. Xho\ and Sail ends are compatible). No detectable fusion 
protein was purified on Ni-Chelate columns from soluble lysates of cells induced to express 
fusion protein from this construct. 
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TABLE 23 

Summary Of Toxin B Expression Constructs 4 





Clone 


rtiiinuy i ag 


Yield (nig/liter) 


% 1 till Length 




pPBI'<M750 


none 


low (estimated from Western 
blot hyb.) 


*» 


5 


pPBIO-1530 


none 


low" (as above) 


*» 




pi»i n i fx) 


MBP 


15mg/t 


0% 






poly-his 


0.5m«'l 


20% 




pi fJ I U-jjU 


poly-his 


-20mg/l (insoluble) 


90% 




#1 1 /W / /I } 7/1 

Pa US itJ-JJU . 


MBP 


iVJiff.e// 


Ui% 


10 


/''W/J. (Hf-jjlt 


MBP 


/Owe// 


$o<y n 




« A //? •» / /J / / / /i 

frf\ff>J l U- / I I U 


MBP 


25mg/t 






pMB5 10-820 


MBP 


degraded (by Western blot 
hyb » 








MBP 


2(hn&l 


writ, 




pMBl 100-1750 


MBP 


15mg/l 


()% 


15 


p.UBI tOO-l 53(1 


MBP 




5 'tit 




pMB 1570- 1750 


MBP 


3 nigs 1- 


5% 




pPB 1530-1 750 


poly-his 


no purified protein detected 


*» 




p\mtSJO-i'50 


MBP 




25% 




p\mr 50-2 36u 


MBP 


-JOm&i 




20 


pM Bp 1750-2360 


MBP 


6.5mg/l (secreted) 


50% 




pPB 1750-2360 


poly-his 


20mg/l 


90% 




pMB 1750-1970 


MBP 


-20mg/l 


90% 




, pMB 1970-2360 


MBP 


40mg/| 


90% 




pMBp 1970-2360 


MBP 


(no secretion* 


NA 


25 


pMB 1X50-2360 


MBP 


20mg/l 


90% 




pPB 1850-2360 


poly-his 


I5mg'l 


•90% 




pMB 1850- 1970 


MBP 


70mg/| 


90% 




pPB 1850- 1970 


poly-his 


"lOmg'l (insoluble) 


90% 




pPB 1850-2070 


poly-his 


IOmg/1 (insoluble) 


90% 


30 


pPB 1750- 1970(c) 


poly-his 


I0mg/| (insoluble) 


90% 




pPB 1750- I970(n) 


poly-his 


lOmg/i (insoluble) 


90% 



Clones til italics arc clones currently utilized to purify recombinant protein from each selected 
interval. 

35 
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EXAMPLE 19 

Identification. Purification And Induction Of Neutralizing 
Antibodies Against Recombinant C. difficile Toxin B Protein 

To determine whether recombinant toxin B polypeptide fragments can generate 
neutralizing antibodies, typically animals would first be immunized with recombinant proteins 
and anti-recombinant antibodies arc generated. These anti-recombinant protein antibodies are 
then tested for neutralizing ability in vivo or in vitro. Depending on the immunogenic nature 
of the recombinant polypeptide, the generation of high-titer antibodies against that protein 
may take several months. To accelerate this process and identify which recombinant 
polypeptide* s) may be the best candidate to generate neutralizing antibodies, depletion studies 
were performed. Specifically, recombinant toxin B polypeptide were pre-screened bv testinu 
whether .they have the ability to bind to protective antibodies from a CTB antibody 
preparation and hence deplete those antibodies of their neutralizing capacity. Those 
recombinant polypeptides found to bind CTB. were then utilized to generate neutralizing 
antibodies. This Example involved; a) identification of recombinant sub-regions within toxin 
B to which neutralizing antibodies bind: b) identification of toxin B sub-region specific 
antibodies that neutralize toxin B in vivo: and c) generation and evaluation of antibodies 
reactive to recombinant toxin B polypeptides. 

a) Identification Of Recombinant Sub-Regions Within Toxin B 
To Which Neutralizing Antibodies Bind 

Sub-regions within toxin B to which neutralizing antibodies hind were identified by 
utilizing recombinant toxin B proteins to deplete protective antibodies from a polyclonal pool 
of antibodies against native ( 4 . difficile toxin B. An in vivo assay was developed to evaluate 
protein preparations for the ability to bind neutralizing antibodies. Recombinant proteins were 
first pre-mixed with antibodies directed against native toxin B (CTB antibody; see Example 8) 
and allowed to react for one hour at 37°C\ Subsequently. C. cliff idle toxin B (CTB; Tech 
Lab) was added at a concentration lethal to hamsters and incubated for another hour at 37°C. 
After incubation this mixture was injected intraperitoneal^ (IP) into hamsters. If the 
recombinant polypeptide contains neutralizing epitopes, the CTB antibodies will lose its 
ability to protect the hamsters against death from CTB. If partial or complete protection 
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occurs with the CTB antibody-recombinant mixture, that recombinant contains onlv weak or 
non-neutralizing epitopes of toxin B. This assay was performed as follows!' 

Antibodies against CTB were generated in egg laving Leghorn hens as described in 
Kxamplc 8. The lethal dosage (LD „„,) of C difficile toxin B when delivered I.P. into 40u 
female Golden Syrian hamsters (Charles River) was determined to be 2.5 to 5 ug. Antibodies 
generated against CTB and purified by PEG precipitation could completclv protect the 
hamsters at the I.P. dosage determined above. The minimal amount of CTB antibodv needed 
to allord good protection against 5 ug of CTB when injected 1. 1* into hamsters was also 
determined ( I X PEG prep) : These experiments defined the parameters needed to test whether 
a given recombinant protein could deplete protective CTB antibodies. 

The cloned regions tested for neutralizing ability cover the entire toxin B gene and 
were designated as Intervals (INT) I through 5 (see Figure .0). Approximately equivalent 
Unal concentrations of each recombinant polypeptide were tested. The following recombinant 
polypeptides were used: I ) a mixture of intervals I and 2 (INT-I. 2): 2) a mixture of 
Intervals 4 and 5 (1NT-4. 5, and 3) Interval 3 (INTO). Recombinant proteins (each at about 
I mg total protein, were firs, preincubated with a final CTB antibody concentration of IX 
l/.c. pellet dissolved in original yolk volume as described in Example l(c)| in a final volume 
ol 5 mis lor I hour at 37»C. Twenty-five ug of CTB (at a concentration of 5 M g/ml). enough 
CTB to kill 5 hamsters, was then added and the mixture was then incubated for I hour at 
37»C. Five. 40g female hamsters (Charles River) in each treatment group were then each 
given , ni | ? f lhc mixture I.P. using a tuberculin syringe with a 27 gauge needle. The results 
of this experiment are shown in Table 24. 



25 



30 



Binding 


TABLE 24 

Of Neutralizing Antibodies Bv INT 3 Protein 


Treatment Group* 


Number Of Animals Alive 


Number Of Animals Dead 


CTB antibodies 


3 




CTB antibodies * INTL2 


3 


-> 


CTB antibodies * INT4.5 


3 


-> 


CTB antibodies + INT 3 


0 


5 



( : ili/Jkile toxin ii (CTB) was added to each group. 



As shown in Table 24. the addition of recombinant proteins from INT-1. 2 or JNT-4 S 
had no effect on the in vfvo protective ability of the CTB antibody preparation compared to 
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the CTB antibody preparation alone. In contrast. INT-3 recombinant polypeptide was able to 
remove all of the toxin B neutralizing ability of the CTB antibodies as demonstrated by the 
death of all the hamsters in that group. 

The above experiment was repeated using two smaller expressed fragments (pMB 
1750-1970 and pMB 1970-2360. see Figure 19) comprising the INT-3 domain to determine if 
that domain could be further subdivided into smaller neutralizing epitopes. In addition, ful!- 
. length INT-3 polypeptide expressed as a nickel tagged protein (pPB 1 750-2360) was tested for 
neutralizing ability and compared to the original INT-3 expressed MBP fusion (pMBI750- 
2360). The results are shown in Table 25. 

TABLE 25 



Treatment Group 1 


Number Of Animals Alive 


Number Of* Animals Dead 


CTB antibodies 


5 


0 


C TB antibodies *• pPB 1750-2360 


0 


5 


CTB antibodies t pMB I 750-2360 


0 


5 


CTB antibodies * pMB 1970-2360 


3 




CTB antibodies - pMB 1750- 1970 


2 


3 



( tfiffhttc loxin B (CTB) was added to each iiroup. 



T he results summarized in Table 25 indicate that the smaller polypeptide fragments 
within the INT-3 domain. pMB! 750-1970 and pMB 1970-2360. partially lose the ability to 
bind to and remove neutralizing antibodies from the CTB antibody pool. These results 
demonstrate thai the full length INT-3 polypeptide is required to completely deplete the CTB 
antibody pool of neutralizing antibodies. This experiment also shows that the neutralization 
epitope of INT-3 ean be expressed in alternative vector systems and the results are 
independent of the vector utilized or the accompanying fusion partner. 

Other Interval .3 specific proteins were subsequently tested for the ability to remove 
neutralizing antibodies within the CTB antibody pool as described above. The Interval 3 
specific proteins used in these studies are summarized in Figure 23. In Figure 23 the 
following abbreviations arc used: pP refers to the pET23 vector: pM refers to the pM A Le- 
vee lor; B refers to toxin B; the numbers refer to the amino acid interval expressed in the 
clone. The solid black ovals represent the MBP; and I IHH represents the poly-histidine lag. 
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Only recombinant proteins comprising the entire toxin B repeat domain (pMB 1 750- 
2360. pPB 1 750-2360 and pPB1850-2360) can bind and completely remove neutralizing 
antibodies from the CTB antibody pool. Recombinant proteins comprising only a portion of 
the toxin B repeat domain were not capable of completely removing neutralizing antibodies 
from the CTB antibody pool (pMB 1750-1970 and pMB 1970-2360 could partially remove 
neutralizing antibodies while pMB 1 850-1 970 and pPB 1 850-2070 failed to remove any 
neutralizing antibodies from the CTB antibody pool). 

The above results demonstrate that only the. complete, ligand binding domain (repeat 
region, of the toxin B gene can bind and completely remove neutralizing antibodies from the 
CTB antibody pool. These results demonstrate that antibodies directed against the entire toxin 
» repeat region are necessary for in vivo toxin neutralization (see Figure 23: only the 
recombinant proteins expressed by the pMBl 750-2360. pI'Bl 750-2360 and pPB 1 850-2360 
vectors are capable of completely removing the neutralizing antibodies from the CTB' 
antibody pooh. 

These results represent the first indication that the entire repeat region of toxin B 
would he necessary for the generation of antibodies capable of neutralizing toxin B. and that 
sub-regions may „oi be sufficient to generate maximal titers of neutralizing antibodies. 

I>) Identification Of Toxin B Sub-Region Specific Antibodies 
That Neutralize Toxin B In Vivo 

To determine if antibodies directed against the toxin B repeat region are sufficient for 
neutralization, region specific antibodies within the CTB antibody preparation were affinity 
purified, and tested for in vivo neutralization. Affinity columns containing recombinant toxin 
B repeal proteins were made as described below. A separate affinity column was prepared 
using each of the following recombinant toxin B repeat proteins: pPB 1750-2360. pPB1850- 
2360. pMB 1750- 1 970 and pMB 1 970-2360. 

I or each affinity column to be made, lour ml of PBS-washcd Actigel resin (Slerogene) 
was coupled overnight at room temperature with 5-10 mg of affinity purified recombinant 
protein (first extensively dialyzed into PBS) in 15 ml tubes ( 1'alcon) containing a 1/10 final 
volume Aid-coupling solution ( 1 M sodium cyunoborohydride). Aliquots of the supcrnatants 
from the coupling reactions, before and after coupling, were assessed by Coomassic staining 
of 7.5% SDS-PAGE gels. Based on protein band intensities, in all cases greater than 30% 
coupling efficiencies were estimated. The resins were poured into 10 ml columns (BioRad). 
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washed extensively with PBS. pre-eluted with 4M guanidine-HCI (in 10 mM Tris-HCI. pH 
8.0) and reequilibrated in PBS. The columns were stored at 4°C. 

Aliquots of a CTB IgY polyclonal antibody preparation (PEG 5 prep) were affinity 
purified on each of the lour columns as described below, The columns were hooked to a UV 
monitor (ISCO). washed with PBS and 40 ml aliquots of a 2X PEG prep (filter sterilized 
using a 0.45 u filter) were applied. The columns were washed with PBS until the baseline 
value was re-established. The columns were then washed with BBStween to clute 
nonspecific-ally binding antibodies, and reequilibrated with PBS. Bound antibody was eiuied 
from the column in 4M guanidine-HCI (in lOmM Tris-HCI. P H8.0). The cluted antibody was 
immediately dialyzed against a 100-fold excess of PBS at 4°C for 2 hrs. The samples were 
then dialyzed extensively against at least 2 changes of PBS. and affinity purified antibody was 
collected and stored at 4%\ The antibody preparations were quantified by UV absorbancc. 
The clution volumes were in the range of 4-8 ml. All affinity purified stocks contained 
similar total antibody concentrations, ranging from 0.25-0.35% of the total protein applied to 
the columns. 

The ability of the affinity purified antibody preparations to neutralize toxin B in vivo 
was determined using the assay outlined in a) above. Affinity purified antibody was diluted 
1:1 in PBS before testing. The results arc shown in fable 26. 

In all cases similar levels of toxin neutralization was observed, such that lethality was 
delayed in all groups relative to prcimmune controls. This result demonstrates that antibodies 
reactive to the repeal region of the toxin B gene are sufficient to neutralize toxin B in vivo. 
the hamsters will eventually die in all groups, but this death is maximally delayed with the 
CTB PEX} prep antibodies. Thus neutralization with the affinity purified (AP) antibodies is 
not as complete as that observed with the CTB prep before affinity chromatography. This 
result may be due to loss of activity during guanidine denaturation (during the elation of the 
antibodies from the affinity column) or the presence of antibodies specific to other regions of 
the toxin B gene that can contribute to toxin neutralization (present in the CTB PF.G prep). 
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TABLE 26 

Neutralization Of Toxin B By' Affinity Purified Antibodies 



Treatment group' 



Preimmunc 1 



CTB': 400 Mg 



Number Animals Alive" 



Number Animals Ocad k 



CTB (AP on pPB 1750-2360);- 875 u« 



C TB (AP on pMBI750-l970): ; 875 ug 



CTB (AP on pMB 1 970-2360);- 500 ug 



< difficile toxin H (CTB) (Tech Lab: at 5 25 Mg total) at lethal concentration to 

hamsters- is added to antibody and incubated for one hour at 37"C. Alter incubation this 
mixture is injected intraperitoncallv (IP) into hamsters .Each treatment aroup received loxin 
prermxed with antibody raised against the indicated protein, as either <4X amibodv PEG nrcn 
or oninity purified (AP) antibody (from CTB PEG prep. „„ indicated columns). The amounf 
ol specific antibody n, each prep is indicated: the amount is directlv determined for affinitv 
purified preps and .s estimated for the 4X CTB as described in Example 15 ' 

1 he numbers in each group represent numbers of hamsters dead or alive. 2 hr post IP 
administration of toxin/antibody mixture. 



The observation that antibodies affinity purified against the non-overlapping 
pMBi 750-1970 and- pMB 1970-2360 proteins neutralized toxin B raised the possibility that 
either I ) antibodies specific to repeat sub-regions are sufficient to neutralize toxin U or 2) 
sub-region specific proteins can bind most or all repeat specific antibodies present in the CTB 
polyclonal pool. This would likely be due to conformational similarities between repeats 
since homology, in the primary amino acid sequences between different repeats is in the range 
of only 25-75% fEichel-Strciber. ei at. ( 1992) Molec. Gen. Genetics 233:260|. These 
possibilities were tested by affinity chromatography. 

The CTB PliG prep was sequentially depleted 2X on the pMB 1750- 1970 column: only 
a small elution peak was observed after the second chromatography, indicating that most 
reactive antibodies were removed. This interval depicted CTB preparation was then 
chromatographed on the pPB 1850-2360 column; no antibody bound to the column. The 
reactivity of the CTB and CTB (pMB 1 750-1970 depleted) preps to pPBl 750-2360. pPB1850- 
2360. pMB 1750- 1 970 and pMB 1 970-2360 proteins was then determined by I- USA using the 
protocol described in Hxumple 13{c). Briefly. 96-well microliter plates ( f alcon. Pro-Bind 
Assay Plates) were coated with recombinant protein by adding 100 u | volumes of protein at 
1-2 ug/ml in PBS containing 0.005% thimcrosal to each well and incubating overnight at 4°C. 
The next morning, the coating suspensions were decanted and the wells were washed three 
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times using PBS. In order to block non-specific binding sites. 100 ul of 1.0% BSA (Sigma) 
in PBS (blocking solution) was then added to each well, and the plates were incubated for ! 
, hr. at 37°C. The blocking solution was decanted and duplicate samples of ISO ul of diluted 
antibody was added to the first well of a dilution series. The initial testing scrum dilution 
was (1/200 for CTB prep, (the concentration of depleted CTB was standardized by OI),,,,) in 
blocking solution containing 0.5% Tween 20. followed by 5-fold serial dilutions into tliLs 
solution. This was accomplished by serially transferring 30 ul aliquots to 120 ul buffer, 
mixing, and repeating the dilution into a fresh well. After the final dilution. 30 ul was 
removed from the well such that all wells contained 120 ul final volume. A total of 5 such 
dilutions were performed (4 wells total). The plates were incubated for I hr at 37°C. 
F ollowing this incubation, the serially diluted samples were decanted and the wells were 
washed three times using PBS containing 0.5% Tween 20 (PBST). followed by two 5 min 
washes using BBS-Tween and a final three washes using PBST. To each well. 100 ul of" 
I 1000 diluted secondary antibody Irabbit anti-chicken IgCi alkaline phosphatase (Sigmai 
diluted in blocking solution containing 0.5% Tween 20| was added, and the plate was 
incubated I hr at 37°C. The conjugate solutions were decanted and the plates were washed 6 
times in PBST. then once in 50 mM Na,CO,. 10 mM MgCT. pH 9.5. The plates were 
developed by the addition of 100 ul of a solution containing I mg/ml para-nitro phenyl 
phosphate (Sigma) dissolved in 50 mM Na.CO,. 10 mM MgCl,. pH9.5 to each well. The 
plates were then incubated at room temperature in the dark for 5-45 min. The absorbency of 
each well was measured at 410 nm using a Dynatech MR 700 plate reader. 

As predicted by the affinity chromatography results, depletion of the CTB prep on the 
PMBI750-I970 column removed all detectable reactivity to the pMB 1970-2360 protein. The 
reciprocal purification of a CTB prep that was depicted on the pMB 1 970-2360 column 
yielded no hound antibody when chromatography on the pMBI750-l970 column. These 
results demonstrate that all repeat reactive antibodies in the CTB polyclonal pool recognize a 
conserved structure that is present in non-overlapping repeats. Although it is possible that this 
conserved structure represents rare conserved linear epitopes, it appears more likely that the 
neutralizing antibodies recognize a specific protein conformation. This conclusion was also 
suggested by the results of Western blot hybridization analysis of CTB reactivity to these 
recombinant proteins. 

Western blots of 7.5% SDS-PAGL gels, loaded and electrophoresed with defined 
quantities of each recombinant protein, were probed with the CTB polyclonal antibody 
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preparation. The blots were prepared and developed with alkaline phosphatase as described in 
Example 3. The results are shown in Figure 24. 

Figure 24 depicts a comparison of immunoreactivity of IgY antibody raised against 
either native or recombinant toxin B antigen. F.qual amounts -of pMB1750-l<>70 (lane I ). 
pMB 1 970-2360 (lane 2). pPBI850-236O (lane 3) as well as a serial dilution of pPB 1750-2360 
(lanes 4-6 comprising IX. 1/1 OX and 1/100X amounts, respectively) proteins were loaded in 
duplicate and resolved on a 7.5% SDS-PAGE gel. The gel was blotted and each half was 
hybridized with PEG prep IgY antibodies from chickens immunized with either native CTB or 
PPBI750-2360 protein. Note that the full-length pMB 1 750- 1 970 protein was identified only 
by antibodies reactive to the recombinant protein (arrows). 

Although the CTB prep reacts with the pPB 1 750-2360. pi'B 1 850-2360. and pMB1970- 
2360 proteins, no reactivity to the pMB 1 750-1 970 protein was observed (Figure 24). Given 
that all repeat reactive antibodies can be bound by this protein during affinity 
chromatography, this result indicates that the protein cannot fold properly on Western blots. 
Since this eliminates all antibody reactivity, it is unlikely that the repeat reactive antibodies in 
the C TB prep recognize linear epitopes. This may indicate that in order to induce protective 
antibodies, recombinant toxin B protein will need to be properly folded. 

e) Generation And Evaluation Of Antibodies Reactive To 
Recombinant Toxin li Polypeptides 
i) Generation Of Antibodies Reactive To 
Recombinant Toxin B Proteins 

Antibodies against recombinant proteins were generated in egg laying Leghorn hens as 
described in Example 13. Antibodies were raised | using Frcunds adjuvant (Gibco) unless 
otherwise indicated! against the following recombinant proteins: I ) a mixture of Interval I +2 
proteins (see Figure 18): 2) a mixture of interval 4 and 5 proteins (see Figure 18): 3) 
pMBl "70-2360 protein: 4) pPB 1 750-2360 protein: 5) pMB 1750-2360: 6) pMB 1 750-2360 
llitermax adjuvant (Vaxcell)): 7) pMB 1 750-2360 (Gerbu adjuvant (Bioiech)|: 8) P MB P 1750- 
2360 protein: <)) p|'B 1 850-2360: and 10) pMB 1850-2360. 

Chickens were boosted at least 3 times with recombinant protein until F.I. ISA 
reactivity (using the protocol described in b) above with the exception that the plates were 
coated with pPB 1750-2360 protein] of polyclonal PF.G preps was at least equal to thai of the 
CTB polyclonal antibody PEG prep. EI.ISA titers were determined for the PEG preps from 
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all of the above immunogens and were found to be comparable ranging from 1:12500 to 
i :62500. High titers were achieved in all cases except in 6) pMBl 750-2360 in which strong 
titers were not observed using the Titerrnax adjuvant, and this preparation was not tested 
further. ' 

ii) Evaluation Of Antibodies Reactive To 
Recombinant Proteins By Western Blot 
Hybridization 

Western blots of 7.5% SDS-PAGC gels, loaded and electrophorcsed with defined 
quantities of recombinant protein (pMBl 750-1970. pPB 1 850-2360. and pMBI970-2360 
proteins and a serial dilution of the pPB 1750-2360 to allow quantification of reactivity), were 
probed with the CTB. pPB I 750-2360. pMB 1 750-2360 and pMB 1970-2360 polyclonal 
antibody preparations (from chickens immunized using Frcunds adjuvant). The blots were 
prepared and developed with alkaline phosphatase as described above in h). 

As -shown in Figure 24. the CTB and pMB 1970-2360 preps reacted strongly with the 
pPB I 750-2360.. pPB 1 850-2360. and pMB 1970-2360 proteins while the pPB 1 750-2360 and 
pMB 1 970-2360 (Gcrbu) preparations reacted strongly with all four proteins. The Western 
blot reactivity of the pPB 1 750-2360 and pMB 1 970-2360 (Gerbu) preparations were equivalent 
to that of the CTB preparation, while reactivity of the pMB 1 970-2360 preparation was <10% 
that of the CTB prep. Despite equivalent ELISA reactivities only weak reactivity 
(approximately 1%) to the recombinant proteins were observed in PEG preps from two 
independent groups immunized with the pMB ! 750-2360 protein and one group immunized 
with the pMBl 750-2360 preparation using Freunds adjuvant'. 

Affinity purification was utilized to determine if this difference in immunoreactivity by 
Western blot analysis rellccts differing antibody titers. Fifty ml 2X PEG preparations from 
chickens immunized with either pMB 1 750-2360 or pMBl 970-2360 protein were 
chromatographed on the pPB 1 750-2360 affinity column from b) above, as described. The 
yield of affinity purified antibody (% total protein in preparation) was equivalent to the yield 
obtained from a CTB PF.G preparation in b) above. Thus, differences in Western reactivity 
relied a qualitative difference in the antibody pools, rather than quantitative differences.. 
These results demonstrate that certain recombinant proteins are more effective at generating 
high affinity antibodies (as assayed by Western blot hybridization). 
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iii) In Vivo Neutralization Of Toxin B Using 

Antibodies Reactive To Recombinant Protein 

The in vivo hamster model (described in Examples 9 and I4(b)]l was utilized to assess 
the neutralizing ability of antibodies raised against recombinant toxin B proteins. The results 
Irom three experiments are shown below in Tables 27-29. 

The ability of each immunogen to neutralize toxin B in vivo has been compiled and is 
shown ,„ ,able 30. As predicted from the recombinant protein-CTB premix studies (Table 
24) only antibodies to Interval 3 ( 1 750-2366) and not the other regions of toxin B (,> 
imenals 1-5) are protective. Unexpectedly, antibodies generated to INT-3 region expressed in 
PMAL vector (pMBl 750-2360 and pMpB 1 750-2360) using Freunds adjuvant were non- 
neutralizing. This observation is reproducible, since no neutralization was observed in two 
."dependent immunizations with pMB 1 750-2360 and one tmmunization with P MpBI750- 
2360, The fact that 5X quantities of affinity purified toxin B repeat specific antibodies from 
pMBl7>0-236(> PHG preps cannot neutralize toxin B while IX quantities of affini.v pur ifi c d 
ant.-n B antibodies can (Table 28) demonstrates that the differential ability of CTB 
antibodies to neutralize toxin B is due to qualitative rather than quantitative differences in 
these antibody preparations. Only when this region was expressed in an alternative vector 
(pPB 1 750-3360) or using an alternative adjuvant with the pMB 1750-2360 protein were 
neutralizing antibodies generated Importantly, antibodies raised using f reunds adjuvant to 
pPB 1850-2360. which contains a fragment that is only 100 amino acids smaller than 
recombinant pPBI 750-2360. are unab.e to neutra.ize toxin B in vivo (Table 27): note also that 
the same vector is used for both pPBI85O-2360 and pPB 1750-2360. 
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Treatment Group" 



I'reimmune 



CTB 



INTI+2 



INT 4*5 



pMB 1 750-2360 



pMB 1970-2360 



pPB 1 750-2360 



TABLE 27 

in Viva Neutralization Of Toxin B 



Number Animals Alive h 



Number Animals Dead h 



d,f!,c,lc toxm B (CTB) (a, 5 uu/n.I: 25 ug to.al; Tech Lab) a. lethal concentration to 
hamsters , s added to antibody and incubated for one hour a, 37°C. Alter incuSn ,h s 
nmuure is m.ected mtraperi.oneally (IP) into hamsters. Each treatmem a roup received toxin 
premised w,,h am.body raised against the indicated protein, as a 4X anybody peg prep 

Hie numbers in each group represent numbers or hamsters dead or alive. 2 hours post ll» 
administration. ol toxin/antibody mixture. 



TABLE 28 

in i tvo Neutralization Of Toxin B Usini! Af ffnilv Purified Antibodies 


Treatment Groups 


Number Animals Alive* 1 


Number Animals Dead* 


Preimmunei 1 ) 


0 


5 


CTB{1) 


5 


0 


pPB 1750-2360(1) 


5 


0 


15 niLi anti-pMB 1 750-2360(2) 


1 


-J 


1-5 mt! anii-pMBI<>70-2360(2) 


0 


> 


300 ho anti-CTB(2) 


- 5 


0 



30 



.53 



T. <CT -k ( T * >,S ' m,: 25 ™ ,0,a,:TeC " ,J,b » » k "' al concentration ... 
hamsters ls added to antibody and incubated lor one hour at 37»C. After incubation I ml ol 

"^«ure is injected intrapeh.oneal.y (,P) in ,o hamsters. Each treatment Soup received 
oxin prem.xed with an. i body raised against the .ndicated protein, as ei.hcr ( | , 4X an,LSv 
uni Z^S^y*/"*?* a, " ib0dy ( °" 3 P PBI750 -2360 resin), either , 5 Zwp 
!i • uroun I infi C TB r ™'-PMB .970-2360: used undated affinity pur.lled a.Hih.uK ) o^O 
tiL group (anti-CTB. repeat spec.lic: used 1/5 diluted anti-CT8 antibody). 

The numbers in each group represent numbers of hams.ers dead or al.ve "» hr nosi-IP 
admmisiration o( toxin/antibody mixture. "~ 
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TABLE 29 



treatment Group* 


Number Animals Alive*' 


Number Animals Dead" 


Pre immune 


0 


' 5 


CTB 


5 


0 


pMB 1970-2360 


0 


5 


pMB 1-850-2360 


0 


5 


pPB 1850-2360 


o 


5 


pMB 1750-2360 (Gerbu acljl 


5 


0 



10 



15 



< \ dijjicik toxin B (CTB) (Tech Lab) at lethal concentration to hamsters is added <o antibodv 
and incubated tor one hour at 37°C. After incubation this mixture is injected intraneritoncalK 
IIP) into hamsters. Lach treatment « roup received toxin premixed uiih antibodv raised a»ainst 
(he indicated protein, as- a 4X antibody PEG prep . ' 

Hie numbers in each group represent numbers or hamsters dead or alive. 2hrs post IP 
administration oi toxin antibody mixture. 
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TABLE 30 
In Vivo Neutral izai ion Of Toxin B 




hither W.Q preparation (PRO) or affinity purified antibodies (AP> 



Z^TVr° tUV ^ lC nculralizalion < 05 no' denoics nu ncutraluauon (5 s dead) 

»l toxin IV 2 hours post-administration of mixture 

NA' denotes not applicable. 

The pl>B 1750-2360 antibody pool confers significant in vivo protection, equivalent to 
that obtained with the affinity purified CTB antibodies. This correlates with the observed high 
affinity of this antibody pool (relative to the pMB 1750-2360 or pMB 1970-2360 pools) as 
assayed by Western blot analysis (figure 24). These results prov.de the first demonstration 
that in viva neutralizing antibodies can be induced using recombinant toxin U protein as 
immunogen. 

T he failure of high concentrations of antibodies raised against the pMB I 750-2360 
protein (using Ircunds adjuvant) to neutralize, while the use of Cicrbu adjuvant and 
pMB 1750-2360 protein generates a neutralizing response, demonstrates that conformation or 
presentation of this protein is essential for (he induction of neutralizing antibodies. These 
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results are consistent with the observation that the neutralizing antibodies produced when 
native CTB is used as an immunogen appear to recognize conformational epitopes (see section 
b) above j. This is the first demonstration that the conformation or presentation of 
recombinant toxin B protein is essential to generate high titers of neutralizing antibodies. 

EXAMPLE 20 

Determination Of Quantitative And Qualitative 
Differences Between pMBl 750-2360. pMBl 750-2360 (Cierbu) 
Or pPB!750-2360 IgY Polyclonal Antibody Preparations 



In Example 19. it was demonstrated thai toxin B neutralizing antibodies could be 
generated using specific recombinant toxin B proteins (pPB 1 750-2360) or specific adjuvants 
Antibodies raised against pMBl 750-2360 were capable o I neutralizing the enterotoxin effect 
«»l toxin B when the recombinant protein was used to immunize hens in- conjunction with the 
Gerbu adjuvant, but not when I reunds adjuvant was used. To determine the basis for these 
aniiycn and adjuvant restrictions, toxin B-spcciiic antibodies present in the neutralizing and 
non-neutralizing PEG preparations were isolated by affinity chromatography and tested for 
qualitative or quantitative differences. The example involved a) purification of anti-toxin B 
specific antibodies from pMBl 750-2360 and pPBl 750-2360 PlXi preparations and b) in vivo 
20 neutralization of toxin B using the affinity purified antibody. 

a) Purification Of specific Antibodies From pMBl 750-2360 And 
pPIH750-2360 PEC; Preparations 

To purify and determine the concentration of specific antibodies (expressed as the 
25 percent of total antibody) within the pPBl 750-2360 (freunds and Gerbu) and pPBl 750-2360 
PEG preparations, defined quantities of these antibody preparations were chromatographed on 
an affinity column containing the entire toxin B repeat region (pPB 1 750-2360). The amount 
of affinity purified antibody was then quantified. 

An affinity column containing the recombinant toxin B repeat protein. pPBl 750-2360. 
was made as follows, l our ml of PBS-washed Actigel resin (Sterogene) was coupled with 5 
mg of pPB 1 750-2360 affinity purified protein (dialyzed into PBS: estimated to be greater than 
05% full length fusion protein) in a 15 ml tube (I alcon) containing 1/10 final volume Aid- 
coupling solution (IM sodium cyanoborohydride). Aliquots of the supernatant from the 



- 132 - 



WO 98/08540 PCT/US97/15394 

coupling reactions, before and after coupling, were assessed by Coomassie staining of 7.5% 
SDS-PAGE gels. Based on protein band intensities, greater than 95% (approximately 5 mg) 
of recombinant protein was coupled to the resin. The coupled resin Was poured into a 10 ml 
column (BioRad). washed extensively with PBS. pre-cluted with 4M guanidine-HCI (in 10 
mM Tris-HCl. pM 8.0: 0.005% thimerosal) and re-equilibrated in PBS and stored at 4°C. 

Aliquots of pMB 1 750-2360. pMB 1750-2360 (Gerbu) or pPB 1 750-2360 IgY polyclonal 
antibody preparations (PEG preps) were affinity purified on the above column as follows 
The column was attached to an I 'V monitor (I SCO), and washed with PBS. Forty nil aliquots 
of 2X Pf*G preps (lllter sterilized using a 0.45 \x filter and quantified by (■)[),„, before 
chromatography) was applied. The column was washed with PBS until the baseline was re- 
established (the column flow-through was saved), washed with BRSTween'to elule 
nonspeci Ileal! y binding antibodies and re-equilibrated with PBS. Bound antibodv was eluted 
from the column in 4M guanidine-HCI (in 10 mM Tris-HCL. pM 8.0. 0.005% thimerosal) and 
the entire elution peak collected in a 15 ml tube (Falcon). The column was re-equilibrated, 
and the column eluate re-chromatographed as described above. The antibody preparations 
were quantified by I IV absorbance (the elution buffer was used to zero the 
spectrophotometer). Approximately 10 fold higher concentrations of total purified antibody 
was obtained upon elution of the first chromatography pass relative to the second pass. The 
low yield from the seeond chromatography pass indicated that most of the specific antibodies 
were removed by the first round of chromatography. 

Pools of affinity purified specific antibodies were prepared by dialysis of the column 
elutes after the first column chromatography pass for the pMBI 750-2360. pMBI 750-2360 
(Gerbu > or pPB I 750-2360 IgY polyclonal antibody preparations. The elutes were collected 
on ice and immediately dialyzed against a 100-fold volume of PBS at 4°C for 2 hrs. The 
samples were then dialyzed against 3 changes of a 65-fold volume of PBS at 4°C\ Dialysis 
was performed for a minimum of 8 hrs per change of PBS. The dialyzed samples were 
collected, eenlrifuged to remove insoluble debris, quantified by OD. xo . and stored at 4°C. 

The percentage of toxin B repeat-specific antibodies present in each preparation was 
determined using the quantifications of antibody yields from the first column pass (amount of 
specific antibody recovered after first pass/total protein loaded). The yield of repeat-specific 
aifinity purified antibody (expressed as the percent of total protein in the preparation) in: 1 ) 
the pMB I 750-2360 PIIG prep was approximately 0.5%. 2) the pMB 1750-2360 (Gerbu) prep 
was approximately 2.3%. and 3) the pPB 1 750-2360 prep was approximately 0.4%. 
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Purification of a CTB ! g V polydona. antibody preparation on the same column demonstrated 
that the concentration of toxin B repeat specific antibodies in the CTB preparation was 0.3So/„ 

These results demonstrate that I) the use of Gerbu adjuvant enhanced the titer of 
specillc antibody produced against the pMB 1750-2360 protein 5-fold relative to immunization 
ustng Freunds adjuvant, and 2> the differences seen in the ,„ Wvv, neutralization abi.itv of the 
PMBI 750-2360 (no, neutralizing) and pPB 1 750-2360 (neutralizing) and CTB (neutralizing 
PEG preps seen in Example 19 was no, due ,o differences in .he titers of repeal-specific ~ 
antibodies in the three preparations because the titer of repeat-specific antibodv was similar 
lor all three preps: therefore the differing ability of the three antibodv preparations to 
neutralize toxin B must reflect qualitative differences in the induced toxin B repeat-specillc 
ant.bod.es. To confirm that qualitative differences exis, between antibodies raised in hens 
immunized with different recombinant pr„,eins and/or different adjuvants, the same amoun, of 
altumy punHed ami-toxin B repeat faa 1870-2360 of toxin B> antibodies from the differen, 
preparations was administered to hamsters using the /„ viva haiuster model" as described 
below. 

10 In vivo Neutralization Of Toxin B Using Affinity Purified 
Antibody 

The in vivo hamster model was utilized to assess the neutralizing ability of the affinity 
purilled antibodies raised aga.ns, recombinant toxin B proteins purilicd in (a, above. As well, 
a 4X IgY PEG preparation from a second independent immunization utilizing the P I'BI750- 
2360 antigen wi,h Freunds adjuvant was ,es,ed for in viva neutralization. The results are 
shown in fable 31. 

The results shown in 'Tabic 31 demonstrate that: 
I ) as shown in Example 19 and reproduced here. 1.5 mg of affinity purified 

antibody from pMB I 750-2360 immunized hens using f reunds adjuvant does 

neutralize toxin B in viva. However. 300 M g u f affini.s puri lied antibody 
from similarly immunized hens utilizing Gerbu adjuvant ■demonstrated complete 
neutralization of toxin U in viva. This demonstrates that (ierbu adjuvant, in 
addition to enhancing the titer of antibodies reactive ,o the pMBl 750-2360 
antigen relative to Freunds adjuvant (demonstrated in (a) above), also enhances 
the yield of neutralizing antibodies to this antigen, greater than 5 fold. 
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► Complete in vivo neutralization of toxin B was observed with 1 .5 mg of 

affinity purified antibody from hens immunized with pPBI 750-2360 antigen, 
but not with pMB 1 750-2360 antigen, when Frcunds adjuvant 'was used This 
demonstrates, using standardized toxin B repeat-spec i lie antibody 
concentrations, that neutralizing antibodies were induced when pPB 1750-2360 
but not pMB I 750-2360 was used as the antigen with Frcunds adjuvant. 
C omplete in vivo neutralization was observed with 300 ng of pMBf 750-2360 
(Gerbu) antibody, but not with 300 M g of pPBI 750-2360 (Frcunds) antibody. 
Thus the pMBI 750-2360 (Gerbu) antibody has a higher titer of neutralizing 
antibodies than the pPB 1750-2360 (Frcunds) antibody. 
C omplete neutralization of toxin B was observed using 300 j.ig of CTB 
antibody (affinity purified (AP)| but not 100 w CTB antibody (AP or PEG- 
prep). This demonstrates that greater than 100 M g of toxin B repeat-specific 
antibody (anti-CTB) is necessary to neutralize 25 jag toxin B in vivo in this 
assay, and that affinity purified antibodies specific to the toxin B repeal interval 
neutralize toxin B as effectively as the PFP prep of lgY raised against the 
entire CTB protein (shown in this assay). 

As was observed with the initial pPB 1 750-2360 (IgY) PliG preparation 
(Fxample 1<>). complete neutralization was observed with a IgY PHG 
preparation isolated from a second independent group of pPBI 750-2360 
(Frcunds) immunized hens. This demonstrates that neutralizing antibodies are 
reproducibly produced when hens are immunized with pPB 1750-2360 protein 
utilizing Freunds adjuvant. 
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TABLE 31 



- Treatment Croup 4 


Number Animals Alive" 


Number Animals Dead 1 " 


Preimmune' 


0 


5 


CTB (300 ng) : 


5 


0 


CTB MOO mu) : 


1 


4 


pMU 1750-2360 (G) (5 mu) J 


5 


0 


pMB!750-236O <G)<I.5 muf 


5 


0 


pMB 1750-2360.(0) (300 u»r 


5 


0 


pMBI750-2j6O (F) (1.5 m«) : 


0 


5 


pPB 1750-2360 (1) (1.5 m») : 


5 


<l 


pPB 1750-2360 <F) (100 M g>- 


1 


-4 


t*TB (100 ni!) 


■> 




PPBI750-2360 (F) (500 u»y 


5 


0 



Ih 



...u ..mmouy (amount ot specific antibody is indicated) and incubated for one hour at ;7T 
Alter incubation, this mixture was injected IP into hamsters ( I 5 total mix injected per 
hamster ». Fach treatment group received toxin premixed with .amihodx raised against the 
! ,K v C .H^ PmlCin (G " crbu ad i uv ™' F-Freunds adjuvant I. ! indicates ihe antibodv was a 4X 
lt:Y I I.C, prep:.- indicates the antibody was affinity purified on a pPB i K50-"» 1(>0 resin ind 
indicates that the antibody was a IX l»Y PFG prep. 

The numbers in each uroup represent numbers ol' hamsters dead <»r alive. 2 hrs post IP 
administration ol* toxmantibodv mixture. 



EXAMPLE 21 

» 

Diagnostic Knzyme Immunoassays l or C difficile Toxins A And U 

I he ability of the recombinant toxin proteins and antibodies raised against these 
recombinant proteins (described in the above examples) to form the basis of diagnostic assays 
for the detection of clostridial toxin in a sample was examined Two immunoassay formats 
were tested to quantitatively detect ( difficile toxin A and toxin H from a biological 
specimen. I he first formal involved a competitive assay in which a fixed amount of 
recombinant toxin A or B was immobilized on a solid support (e.» . microliter plate wells) 
followed by the addition ol a toxin-conlaining biological specimen mixed with affinity- 
purified or l»KG fractionated antibodies against recombinant toxin A or li. If toxin is present 
in a specimen, this toxin will compete with the immobilized recombinant toxin protein for 
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binding to the anti^recombinam antibody thereby reducing the signal obtained following the 
addition of a reporter reagent. The reporter reagent detects the presence of antibody bound to 
, the immobilized toxin protein. 

In the second format, a sandwich immunoassay was developed using affinity-purified 
antibodies to recombinant toxin A and B. The affinity-purified antibodies to recombinant 
toxin A and B were used to coat microliter wells instead of the recombinant polypeptides (as 
was clone in the competitive assay format). Biological samples containing toxin A or B were 
.hen added to the wells followed by the addition of a reporter reagent to detect the presence 
of bound toxin in the well. 

a) Competitive Immunoassay For The Detection Of C difficile 
Toxin 

Recombinant toxin A or B was attached to a solid support by coating 96 well 
microliter plates with the toxin protein at a concentration of lug/ml in PBS. The plates were 
incubated overnight at 2-8°C. The following- morning; the coating solutions were removed 
and ihe remaining protein binding sites on the wells were blocked by filling each well with a 
l»BS. solution containing 0.5% BSA and 0.05% Tween-2o! Native < . difficile toxin A or B 
( l ech Lab) was diluted to 4 ug/ml in stool extracts from healthy Syrian hamsters (Sasco). 
The siool extracts were made by placing fecal pellets in a 15 ml centrifuge tube: PBS was 
added at 2 ml/pellet and the tube was voriexed to create a uniform suspension. The tube was 
then centrifuged ai 2000 rpm for 5 min at room temperature. The supernatant was removed: 
this comprises ihe siool extract, fifty ul of the hamster stool extract was pipetted into each 
well of the microliter plates to serve as the diluent for serial dilutions of the 4 ug/ml toxin 
samples. One hundred ul of the toxin samples a. 4 ug/ml was pipetted into the first row of 
wells in the microliter plate, and 50 ul aliquots were removed and diluted serially down the 
plate in duplicate. An equal volume of affinity purified anti-recombinant loxin antibodies 1 1 
ng/well ol" ami-pMA I X70-2680 antibody was used for the detection of toxin A: 0.5 ng/well of 
anii- P MBI750-2360((ierbu) was used lor the detection of toxin B| were added to appropriate 
wells, and ihe plates were incubated at room temperature for 2 hours with gentle agitation. 
Wells serving as negative control contained antibody but no native toxin to compete for 
binding. 

Unbound toxin and antibody were removed by washing the plates 3 to 5 times with 
PBS containing 0.05% Tween-20. Following the wash step. 100 M | of rabbit anti-chicken IgG 
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antibody conjugated to alkaline phosphatase (Sigma, was added to each well and the plates 
were incubated for 2 hours at room temperature. The plates were then washed as before to 
remove unbound secondary antibody. Freshly prepared alkaline phosphatase substrate (1 
mg/ml p-nitrophenyl phosphate (Sigma) in 50 mM Na : CO : . pH 9.5: 10 mM MgCK) was 
added to each well. Once sufficient color developed, the plates were read on a Dynatech 
MR700 microliter plate reader using a 410 nm tiller. 

The results are summarized in Tables 32 and 33. I or the results shown in Table 32. 
the wells were coated with recombinant toxin A protein (pMA 1 870-2680). The amount of 
native toxin A added (present as an addition to solubilized hamster stool, to a given well is 
indicated (0 to 200 ng>. Antibody raised against the recombinant loxin A protein. P MAI870- 
2680. was affinity purified on the an affinity column containing pPA 1 870-2680 (described in 
Lxample 20). As shown in Table 32. the recombinant toxin A protein and affinity-purified 
antitoxin can be used for the basis of a competitive immunoassay tor the detection of toxin A 
in biological samples. 

Similar results were obtained using the recombinant toxin B. pl*B 1 750-2360. and 
antibodies raised against pMBI 750-2360(Gerbu>. For the results shown m Table 33. the 
wells were coated with recombinant loxin B protein (pPBI750-236(». The amount of native 
loxin B added (present as an addition to solubilized hamster stool) to a given well is indicated 
(0 to 200 ng). Antibody raised against the recombinant toxin B protein. p.MBI750- 
2360((ierbu>. was affinity purified on the an affinity column containing pPB 1 850-2360 
(described in Lxample 20). As shown in f able 33. the recombinant toxin B protein and 
affinity-purified antitoxin can be used for the basis of a competitive immunoassay tor the 
detection of toxin B in biological samples. 

In this competition assay, the reduction is considered significant over the background 
levels at all points: therefore the assay can be used to delect samples containing less than 12.5 
ng toxin A/well and as little as 50-100 ng toxin B/well. 
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TABLE 32 

'Competitive Inhibition Of Anti-C difficile Toxin A By Native Toxin A 



10 



15 



20 



Toxin A/ Well 


: j _ _ 

OD,,„ Readout 




0.176 


100 


0.253 


50 


0.240 


25 


0.259 


12.5 


0.309 


6.25 


0.367 


.1.125 


0.417 


0 


0.590 


TABLE 33 

Competitive Inhibition Of Anti-C. difficile Toxin B Bv Native Toxin B 


n&i' Toxin B/Well 


OD„„ Readout 


200 


0.392 


100 


0.566 


r >o 


0.607 


25 


0.778 


12.5 


0970 


6.25 


0.902 


3.125 


L040 


0 


1.055 



I hese competitive inhibition assays demonstrate that native C. difficile toxins and 
recombinant ( difficile toxin proteins can compete for binding to antibodies raised against 
recombinant C . difficile toxins demonstrating that these anti-recombinanl toxin antibodies 
provide effective diagnostic reagents. 

b) Sandwich Immunoassay For The Detection Of C difficile 
Toxin 

Affinity-purified antibodies against recombinant toxin A or toxin B were immobilized 
to % well microliter plates as follows. The wells were passively coaled overnight at 4°C 
with affinity purified antibodies raised against either pMA I 870-26X0 <toxin A) or pMBl 750- 
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2360(Gerbu> (toxin B). The antibodies were affinity purified as dcscribcd in ExampJe 1Q 
Tl* anttbodies were used at a concentration of I ug/ml and , 00 u , was added to each 
m,crot,«er well. The wells were then blocked with 200 ul «, 0 5% BSA in PBS tor ^ hours 
at roon, temperature and the blocking solution was then decanted. Stoo, samples from hcalthv 
Synan hamsters were resuspended in PBS, pi J 7.4 (2 ml PBS/s(ool pellet was used to 
resuspend the pellets and the sample was cemrifuged as described above, The stool 
suspension was then spiked with native C. J^cUe toxin A or B (Tech Lab, a, 4 Mg / m , T he 
stool suspensions containing toxin (either toxin A or toxin B, were then seria.lv diluted two- 
fold m stool suspension without toxin and 50 ul was added in duplicate to the coated 
m-crotiter well, Wells containing stool suspension without toxin served as the negative 
control. 

, The plates were incubated for 2 hours at room temperature and then were washed 
inn* times with PBS. One hundred u. of either goat anti-native toxin A or ,oat ami-native 
•oxm B , I cch l.ab, diluted 1 : 1 000 in PBS contaming I % BSA and 0.05% I ween ^0 was 
added to each well . The plates were incubated for another 2 hours a, room temperature 
I he plates uere then washed as before and 100 p. of alkaline phosphatase-conju.ated rabbit 
ant.-goat IgG (C appel. Durham. N.C.) was added at a dilution of ,| : |000. The plates were 
mcubated for another 2 hours a, room temperature. The plates were washed as before then 
developed by the addition of 1 00 ul/well of a substrate solution containing . miI /ml p . 
nuropheny. phosphate (Sigma, in 50 mM Na.CO, P H 0.5: .0 „,M MgC. The" absorbanee of 
each well was measured using a plate reader (Hynatechl at 410 nm. The assav results are 
shown in 'fables 34 and 35. 

t 

TABLE 34 



7oxin A/Well 


".."...vruiuicu /vminouius Against Toxin A 
OD l|(( Readout 


200 


0.9 


100 


0.8 


50 


0.73 , 


25 


0.71 


12.5 


0,59 


6.25 


0.421 


0 


0 
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TABLE 35 



n« Toxin B/Well 


OD, lo Readout 


200 


1.2 


100 


0.973 


50 


0.887 


25 


0.846 


12.5 


0.651 


6.25 


0.431 


0 


0.004 



10 



15 



20 



The results shown in Tables 34 and 35 show that antibodies raised against recombinant 
toxin A and toxin B fragments can be used to detect the presence of ( . difficile toxin in stool 
samples. These antibodies form the basis tor a sensitive sandwich immunoassay which. is 
capable of detecting as little as (>.25 ng of either toxin A or B in a 50 ul stool sample. As 
shown above in Tables 34 and 35. the background lor this sandwich immunoassay is 
extremely low: therefore, the sensitivity of this assay is much lower than 6,25 ng toxin/well. 
It is likely that toxin levels of 0.5 to 1.0 pg/well could be detected by this assay. 

The results shown above in Tables 32-35 demonstrate clear utility of the recombinant 
reagents in ( '. difficile, toxin detection systems. 



EXAMPLE 22 

C onstruction And Expression Of C hntulinum C Fragment f usion Proteins 



30 



The C. hoiulinwu type A neurotoxin gene has been cloned and sequenced (Thompson. 
« ul... Eur. J. Biochem. 189:73 ( I990)|. The nucleotide sequence of the toxin gene is 
available from the EMBL/GenBank sequence data banks under the accession number X52066: 
the nucleotide sequence of the coding region is listed in SCO ID NO:27. The amino acid 
sequence of the C. hoiulinum type A neurotoxin is listed in SEQ ID NO:2X. The type A 
neurotoxin gene is synthesized as a single polypeptide chain which is processed to form a 
dimer composed of a light and a heavy chain linked via disulfide bonds. The 50 kD earboxy- 
terminal portion of the heavy chain is referred to as the C fragment or the II, domain. 

Previous attempts by others to express polypeptides comprising the C fragment off. 
hoiulinum type A toxin as a native polypeptide U:f>.. not as a fusion protein) in £. coli have 
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been unsuccessful [H.F. LaPenotiere. et al. in Botulinum and Tetanus' Neurotoxins. DasGupta. 
Ed.. Plenum Press. New York (1993). pp. 463-466]. Expression of the C fragment as a 
fusion with the £. col'i MBP was reported to result in the production ofiinsoluble protein 
(H.F. LaPenotiere. et al.. supra). 

In order to produce soluble recombinant C fragment proteins in E. coli. fusion proteins 
comprising a synthetic C fragment gene derived from the C. hotulinum type A toxin and 
either a portion of the ( : difficile toxin protein or the MBP were constructed. This example 
involved a) the construction of plasmids encoding C fragment fusion proteins and b) 
expression of ( *. hotulinum C fragment fusion proteins in E. coli. 

a) Construction Of Plasmids Encoding C Fragment Fusion 
Proteins 

In F.xample I I. it was demonstrated that the ( '.' difficile toxin A repeat domain can be 
efficiently expressed and purified in E. coli as either native (expressed in the pliT 23a vector 
in clone pl>A 1870-2680) or fusion (expressed in the pMALc vector as a fusion with the E 
coli MBP in clone P MA1870-2680) proteins, fusion proteins comprising a fusion between 
the MBP. portions of the C difficile toxin A repeat domain (shown to be expressed as a 
soluble fusion protein) and the C fragment of the C. hotulinum type A toxin were constructed. 
A fusion protein comprising the C fragment of the C hotulinum type A toxin and the MBP 
was also constructed. 

f igure 25 provides a schematic representation of the botulinal fusion proteins along 
with the donor constructs containing the ( difficile toxin A sequences or < •. hotulinum V 
fragment sequences which were used to generate the botulinal fusion proteins. In Figure 25. 
the solid boxes represent C difficile toxin A gene sequences, the open boxes represent ( 
hotulinum C fragment sequences and the solid black ovals represent the coli MBP. When 
the name for a restriction enzyme appears inside parenthesis, this indicates that the restriction 
sue was destroyed during construction. An asterisk appearing with the name for a restriction 
enzyme indicates that this restriction site was recreated at the cloning junction. 

In Figure 25. a restriction map of the P MA1870-2680 and pPAl 100-2680 constructs 
(described in Example 11) which contain sequences derived from the C difficile toxin A 
repeat domain are shown: these constructs were used as the source off. difficile toxin A gene 
sequences for the construction of plasmids encoding fusions between the ( . hotulinum C 
fragment gene and the C. difficile toxin A gene. The pMA 1870-2680 express.on construct 
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expresses high levels of soluble, intact fusion protein (20 mg/liter culture) which can be 
affinity purified on an amylose column (purification described in Example I Id). 

The pAlterBot construct (Figure 25) was used as the source of C. hoiulinum C 
fragment gene sequences for the botulinal fusion proteins. pAlterBot was obtained from J. 
Middlebrook and R. Lemley at the U.S. Department of Defense. pAlterBot contains a 
synthetic C. hoiulinum C fragment inserted in to the pALTER-K",' vector (Promcga). This 
synthetic C fragment gene encodes the same amino acids as does the naturally occurring C 
fragment gene. The naturally occurring C fragment sequences, like most clostridial genes, are 
extremely AT rich ( Thompson ci til., supra). This high A/T content creates expression 
difficulties in tW/ and yeast due to altered codon usage frequency and fortuitous 
polyaden\ lation sites! respectively. In order to improve the expression ore fragment proteins 
. in /; caii . a synthetic version of the gene was created in which the non-preferred codons were 
replaced with preferred codons. 

The nucleotide sequence of the ( '. hoiulinum C fragment gene sequences contained 
within pAlterBot is listed in SEQ ID NO:22. The first six nucleotides ( ATGGCT) encode a 
methionine and alanine residue, respectively. These two amino acids result from the insertion 
of the C hoiulinum C fragment sequences into the pALTER'K) vector and provide the initiator 
methionine residue. The amino acid sequence of the C. hoiulinum C fragment encoded by the 
sequences contained within pAlterBot is listed in SEQ ID NO:23. I he first two amino acids 
(Mel Ala) are encoded by vector-derived sequences. From the third amino acid residue 
onward (Argl. the amino acid sequence is identical to that found in the C. hoiulinum type A 
toxin gene. 

The pfvlA 1 870-2680. pPA 1 1 00-2680 and pAlterBot constructs were used as progenitor 
plasmids to make expression constructs in which fragments of the ( difficile toxin A repeat 
domain were expressed as genetic fusions with the C. hoiulinum C fragment gene using the 
pMAl.-c expression vector (New England BioEabs). The pMAE-c expression vector 
generates fusion proteins which contain the MBP at the ammo-terminal end of the protein. A 
construct. pMBot. in which the < hoiulinum C fragment gene xvas expressed as a fusion with 
only the MBP was constructed (Figure 25). Fusion protein expression was induced from E. 
coli strains harboring the above plasmids. and induced protein was affinity purified on an 
amylose resin column. 
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i) Construction Of pBlueBot 

In-order to facilitate the cloning of the C. boiulim,,,, C fragment gene sequences into a 
number of desired constructs, the botulinal gene sequences u^re removed from pAltcrBot and 
were inserted into the pBiuescript plasmid (Stratagene) to generate pBlueBot (f igure 25). 
pBlueBot was constructed as follows. Bacteria containing the pAltcrBot plasmid were grown 
in medium containing tetracycline and plasmid DNA was isolated using the Q| Aprep-spin 
Plasmid Kit (Qiagen). One microgram of pAltcrBot DNA was digested with \col and the 
resulting .V recessed sticky end was made blunt using the Klenow fragment of DNA 
polymerase I (here after the Klenow fragment). The pAlterBot DNA was then digested with 
///Will to release the botulinal gene sequences (the Bot insert) as a blunt (filled Xcol site)- 
"/Will fragment, pliluescripr vector DNA was prepared by digesting 200 ng of pBiuescript 
DNA with.V„,«| and ///Will. The digestion products from both plasmids were resolved on an 
agarose gel. The appropriate f ragments were removed from the gel. mixed and purified, 
utilizing the Prep-a-Genc kit (UioRad). The eluted DNA was then liyaled using 14 DNA 
ligase and used to transform competent DH5« cells (Ciibco-BRI). Host cells were made 
competent for transformation using the calcium chloride protocol of Sambrook e, «/.. supra at 
1. 82- 1. XV Recombinant clones were isolated and confirmed by restriction digestion using 
standard recombinant molecular biology techniques (Sambrook el al. supra). The resultant 
clone. pBlueBot. contains several useful unique restriction sites Hanking the Bot insert (i.e.. 
the C hondimm, C fragment sequences derived from pAlterBot) as shown- in f igure 25. 

ii) Construction Of C. difficile / C. botulinum I 
MBP Fusion Proteins 

C onstructs encoding fusions between the C. difficile toxin A gene and the C \ bond mum 
C fragment gene and the MBP were made utilizing the same recombinant DNA methodology 
outlined above: these fusion proteins contained varying amounts of the ( difficile toxin A 
repeat domain. 

The pMABot clone contains a 2.4 kb insert derived from the ( d,ffkile toxin A gene 
fused to the Bot insert (U. the C. hon,lim,m C fragment sequences derived from pAlterBot). 
pMABot (figure 25) was constructed by mixing gel-purified DNA from .WI//V,W|[| digested 
pBlueBot (the 1.2 kb Bot fragment). SpeWNoll digested pPAl 100-2680 (the 2.4 kb C. difficile 
toxin A repeat fragment) and A7,<,I///,W1U digested pMAI.-c vector., Recombinant clones 
were isolated, confirmed by restriction digestion and purified using the Ql Aprep-spin Plasmid 
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Kit (Qiagen). This clone expresses the toxin A repeats and the botulinal C fragment protein 
sequences as an in-frame fusion with the MBP. 

The pMCABot construct contains a 1.0 kb insert derived from 1 the C. difficile toxin A 
gene fused to the Bot insert (i.e. the C halulinum C fragment sequences derived from 
pAlterBot). pMCABot was constructed by digesting the pMABot clone with AVoRI to 
remove the 5" end of the C difficile toxin A repeat (see Figure 25 the pMAl.-c vector 
contains a EcoRl site 5" to the C. difficile insert in the pMABot clone)., The restriction sites 
were tilled and rcligated together alter gel purification. The resultant clone (pMCABot. 
Figure 25) generated an in-frame fusion between the MBP and the remaining .V portion of the 
( ". difficile toxin A repeat domain fused to the Bot gene. 

The pMNABot clone contains the 1 kb S/k>UEcoRI Milled) fragment from the C V 
difficile toxin A repeat domain (derived from clone pPAl 100-2680) and the 1.2 kh ( ' 
hnhdimm, C fragment gene as a Xcol (filled)//////^ II fragment (derived from pAlterBot). 
.These two fragments were inserted into the pMAL-c vector digested with XhuVHindUX. The 
two insert fragments were generated by digestion of the appropriate plasmid with Ec<»R\ 
(pPAl 100-2680) or \cn\ (pAlterBot) followed by treatment with the Klenow fragment. After 
ireatmcm with the Klenow fragment, the plasmids were digested with the second enzyme 
(either Sfwl oi ///WIN). All three fragments were gel purified, mixed and Prep-n-Genc 
purified prior to ligation. Following ligation and transformation, putative recombinants were 
.■analyzed by restriction analysis: the EcoR\ site was found to be regenerated at the fusion 
junction, us was predicted for a fusion between the filled EcuRX and Nco\ sites. 

A construct encoding a fusion protein between the botulinal C fragment gene and the 
MBP gene was constructed (i.e.. this fusion lacks any C difficile toxin A gene sequences) and 
termed pMBot. The pMBot construct was made by removal of the C. difficile toxin A 
sequences from the pMABot construct and fusing the C fragment gene sequences to the MBP. 
This was accomplished by digestion of pMABot DNA with .SV//I (located in the pMALc 
polylinkcr 5" to the Xha\ site) and Xba\ (located .V to the Awl site at the toxA-Bot fusion 
junction), filling in the Xhal site using the Klenow fragment, gel purifying the desired 
restriction fragment, and ligating the blunt ends to circularize the plasmid. Following ligation 
and transformation, putative recombinants were analyzed by restriction mapping of the Bot 
insert (i.e. the (' hoiulinum C fragment sequences). 



- 145 - 



10 



15 



20 



2^ 



WO 98/08540 PCIYUS97/1S394 

b) Expression Of C. botulinum C Fragment Fusion Proteins In 
E. colt 

Large scale ( 1 liter) cultures of the pMAL-c vector, and each recombinant construct 
described above in (a) were grown, induced, and soluble protein fractions were isolated as 
described in Example 18. The soluble protein extracts were chromatographed on amylosc 
affinity columns to isolate recombinant fusion protein. The purified recombinant fusion 
proteins were analyzed by running samples on SDS-PAGE gels followed by Coomassie 
staining and by Western blot analysis as described [Williams e, «/. (1994) .„,,„<,]. In brief, 
extracts were prepared and chromatographed in column buffer ( 10 n,M Nal>0 4 . 0.5 M NaCI. 
10 mM P-mercaptocthanol. pH 7.2) over an amvlose resin (New England Biolabs) column, 
and eluted with column buffer containing 10 mM maltose as described | Williams, cl at. 
(IW4). .supra\. An SDS-PACiF. gel containing the purified protein samples stained with 
Coomassie blue is shown in Figure 26. 

In Figure 26. the following samples were loaded. Lanes 1-6 contain protein purified 
from li. cW, containing the pMAL-c. pPA 1870-2680. pMABol. pMNABot. pMCABot and 
pMBot plasmids. ; respectively. Lane 7 contains broad range molecular weight protein markers 
(BiuRad). 

The protein samples were prepared for electrophoresis by mixing 5 ul of eluted protein 
with 5 M l of 2X SDS-I'AGE sample buffer (0.125 mM Tris-HCI. pM 6.8. 2 mM EDTA. 6% 
SDS. 20% glycerol. 0.025% bromophenol blue: (J-mercaptoethanol is added to 5% before 
use). The samples were heated to 95°C for 5 min. then cooled and loaded on a 7.5% agarose 
SDS-PACili gel. Broad range molecular weight protein markers were also loaded to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected generally by staining the gel with Coomassie blue. 

In all cases the yields were in excess of 20 nig fusion protein per liter culture (see 
Table 36) and. with the exception of the pMCABot protein, a high percentage (i.e.. greater 
than 20-50% of total eluted protein) of the eluted fusion protein was of a MW predicted for 
the full length fusion protein (f igure 26). It was estimated (In visual inspection) that less 
than 10% of the pMCABot fusion protein was expressed as .the full length fusion protein. 
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TABLE 36 



10 



15 



20 



Construct 


Yield (mg/liter of Culture) 


Percentage Of Total 
Soluble Protein 


pMAHoi 


24 


5.0 


pMCABot 


34 


5.0 


pMNABoi 


40 


5.5 


pMBot 


22 


5.0 


PMAIK70-2680 


40 


48 



These results demonstrate that high level expression of intact ('. hamlinum C 
iraumcnt/C difficile toxin A fusion proteins in E. ca/lis feasible using the pMAL-c 
expression system. These results are in contrast to those reported by H. F. LaPcnoticrc. el aL 
(l c W). supra. In addition, these results show that it is not necessary u> fuse the botulinal C 
fragment gene to the C. difficile toxin A gene in order to produce a soluble fusion protein 
using the pMAI.-c system in E. colL 

In order to determine whether the above-described botulinal fusion proteins were 
recognized by anti-C hutulinum toxin A antibodies. Western blots were performed. Samples 
containing affinity-purified proteins from./:, edit containing the pMABot. "pMCABot. 
pMNABot. pMBot. pMA 1 870-2680 or pMALc plasmids were analyzed. SDS-PAGE gels 
(7.5% acrylamide) were loaded with protein samples purified from each expression construct. 
Alter electrophoresis, the gels were blotted and protein transfer was confirmed by Ponceau S 
staining (as described in Example 12b). 

Following protein transfer, the blots were blocked by incubation for 1 hr at 20°C in 
blocking buffer (PBST (PBS containing 0.1% Tvveen 20 and 5% dry milk)|. The blots were 
then incubated in 10 mi of a solution containing the primary antibody; this solution comprised 
a 1/500 dilution of an anti-C. hamlmum toxin A IgY PEG prep (described in Example 3) in 
blocking buffer. The blots were incubated for I hr at room temperature in the presence of the 
primary antibody. The blots were washed and developed using a rabbit anti-chicken alkaline 
phosphatase conjugate (Boehringer Mannheim) as the secondary antibody as follows. Hie 
rabbit anti-chicken antibody was diluted to I ug/mi in blocking buffer (10 ml final volume 
per blot) and the blots were incubated at room temperature for I hour in the presence of the 
secondary antibody. The blots were then washed successively with PBS T. BBS-Tween and 
50 mM Na,CO v pH 9.5. The blots were then developed in freshly -prepared alkaline 
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phosphatase substrate buffer (100 M g/ml nitro blue tctrazolium. 50 ug/ml 5-bromo-chloro- 
indoly.phosphate. 5 ,„M MgCl, in 50 mM Na : CO,. pH 9.5). Development was stopped bv 
flooding the blots with distilled water and the blots were air dried. i 

This Western blot analysis detected anti-C. W™,m toxin reactive proteins in the 

pMABot. pMCABot. pMNABot and pMBot protein samples (corresponding to the predicted 

full length proteins identilied above by Coomasste staining in Figure 26). but not in the 

pMAI 100-2680 or pMALc protein samples. 

These results demonstrate that the relevant fusion proteins purified" on an amvlose resin 

as described above in section a) contained immunorcactive C ho,ul,nu,„ C fragment protein as 

predicted. 



EXAMPLE 23 

Generation Of Neutralizing Antibodies 
By Nasal Administration Of pMBot Protein 

The ability of the recombinant botulinal toxin proteins produced in Lxample 22 to 
stimulate a systemic immune response against botulinal toxin epitopes was assessed This 
example involved: a) the evaluation of the induction of serum IgCi titers produced by nasal or 
oral administration of botulinal toxin-containing C V difficile toxin A fusion proteins and b) 
the in vivo neutralization of C •. /,„,„/,„„„, tvpc A IKWOIOxin bv anti . rccombinam ( • 
hoiulhwni C fragment antibodies. 

a) Evaluation Of The Induction Of Serum IgC; Titers Produced 
By Nasal Or Oral Administration Of Botulinal Toxin- 
Conlaining C. difficile Toxin A Fusion Proteins 

Six groups containing five 6 week old CF female rats (Charles River) per group were 
immunized nasally or orally with one of the following three combinations us.ng protein 
prepared in Hxamplc 22: < I) 250 ug pMBot protein per rat (nasal and oral); 2) 250 ug 
pMAIiot protein per rat (nasal and oral): 3) 125 ug pMBot admixed with 125 M g P MAI870- 
2680 per rat (nasal and oral). A second set of 5 groups containing 3 CF female rats/group 
were immunized nasally or orally with one of the following combinations (4) 250 ug 
pMNABot protein per rat (nasal and oral) or 5) 250 »g pMAI.-c protein per rat (nasal and 
oral). 
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The fusion proteins were prepared for immunization as follows. The proteins (in 
column buffer containing 10 mM nialtpse) were diluted in 0.1 M carbonate buffer. pH 9.5 and 
administered orally or nasally in a 200 ul volume. The rats were ligWly sedated with ether 
prior to administration. The oral dosing was accomplished using □ 20 gauge feeding needle. 
The nasal dosing was performed using a P-200 micro-pipettor (Gilson). The rats were 
boosted 14 days after the primary immunization using the techniques described above and 
were bled 7 days later. Rats from each group were lightly etherized and bled from the tail 
The blood was allowed to clot at 37°C for I hr and the serum was collected. 

The serum from individual rats was analyzed using an F.LISA to determine the anti-C 
W/„,„„ type A toxin IgG serum titer. The ELISA protocol used is a modification of that 
described in Example I3c. Briefly. 96-well microtiter plates (Falcon. Pro-Bind Assay Plates) 
were coated with C. hmulinum type A toxoid (prepared as described in -Example 3a) by 
placing 100 M | volumes of ( . W,,„„„ , yp e A toxoid at 2.5 M g/ml in PBS containing 
0.005% ,himerosal in eachwell and incubating overnight at 4°C. The next morning, the 
coating suspensions were decanted and all wells were washed three times using PBS. 

In order to block non-specific binding sites. 100 M l of blocking solution [0.5% BSA in 
l'BS| was then added to each well and the plates were incubated for I hr at 37°C.\ The 
Mocking solution was decanted and duplicate samples of 150 ul of diluted rat serum added to 
the first well of a dilution series. The initial testing serum dilution was 1:30 in blockinu 
solution containing 0.5% Tween 20 followed by 5-fold dilutions into this solution. TlJwas 
accomplished by serially transferring 30 ul aliquots to 120 ul blocking solution containing 
0.5% Tween 20. mixing, and repealing the dilution into a fresh well. After the final dilution. 
30 M l was removed from the well such that all wells contained 120 ul final volume. A total 
of 3 such dilutions were performed (4 wells total). The plates were incubated 1 hr at 37°C. 
f ollowing this incubation, the serially diluted samples were decanted and the wells were 
washed six times using PBS containing 0.5% Tween 20 (PBST). To each well. 100 ul of a 
rabbit anti-Rat IgG alkaline phosphatase (Sigma) diluted (l/l 000) in blocking buffer 
containing 0.5% Tween 20 was added and the plate was incubated for I hr at 37»C. The 
conjugate solutions were decanted and the plates were washed as described above, substituting 
50 mM NaX-O, pH 9.5 for the PBST in the final wash. The plates were developed by the " 
addition of 100 ul of a solution containing I mg/ml para-nitro phenyl phosphate (Sigma) 
dissolved in 50 mM Na : CO v 10 ,„M MgCI, pH 9.5 to each well, and incubating the plates a. 
room temperature in the dark for 5-45 min. The absorbency of each well was measured at 
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410 nm using a Dynatech MR 700 plate reader. The results are summarized in Tables 37 and 
38 and represent mean serum reactivities of individual mice. 



TABLE 37 

!KMcmiiiimio« Of Anii-T htm,imum Type A limn Scram led Titers I olkmin- |. 



immunization With C. Mr«/«mm ( | ragmcnK ««iUimng 





imunt/.atinn 




Nasal 


Oral 




IMMUNF 


pMlkit 


pMttot & 
pM A 1X7(1- 
2ft80 


pM Allot 


pMllot 


pMlkMA 
pMAI870- 
26K0 


pMAUfil 


Dilution 












1 M) 


(1 0K0 




1 030 


0 OfiO 


0 i<H\ 


0 0K0 


0 120 


1 15(1 


(Mil? 


*i.5«n 


O 540 


0 022 


0 070 


0.020 


0 027 


1 750 


O.IMW 


O.2X0 


0.260 


0010 


0 020 


0 010 


0 014 


1:3750 


(MK>7 


OOK4 


CM M0 


0 (HW 


(MMH) 


0 010 


0 007 


' " Kai* Tested 




> 








"> 


y 



\»mbcrs represent die averse value* obtained Iron, mo IIINA plmev Mandard^cd ui 



idi/in-i d»c prcimmtniL- amtrol 



TABLE 38 

Determination OfAnti-C. hotulmum Type A Toxin Serum l»G Tilers 



Rome of In 


imunization 


Nasal 


Oral 


Immuno^cn 


PRE-IMMUNE 


pMBot 


| pMABot 


pMNABot 


pMNAHot 


Dilution 








1:30 


0.040 


0.557 


0 010 


0.015 


0.010 


I T 50 


0.009 


0.383 


0.001 


0.003 


0 002 


1:750 


0.00 1 


0.140 


0.000 


0.000 


0.000 


1:3750 


0.000 


0.040 


0.000 


0.000 


0.000 


« Rats Tested 




1 


1 


3 


3 



The above UUSA results demonstrate that reactivity against the bolulinal Fusion 
proteins was strongest when the route of administration was nasal: only weak responses were 
stimulated when the botulinal fusion proteins were given oralis. Nasally delivered pMbot and 
pMBot admixed with pMA 1 870-2680 invoked the greatest serum IgG response. These results 
show that only the pMBot protein is necessary to induce this response, since the addition of 
the pMA 1 870-2680 protein did not enhance antibody response (Table ?>7). Placement of ihe 
C. difficile toxin A fragment between the MBP and the C. haiulimtm C fragment protein 
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dramatically reduced anti-bot IgG titer (see results using pMABot. pMCABot and pMNABot 
proteins). 

This study demonstrates that the pMBot protein induces a strong serum IgG response 
directed against C. botulinum type A toxin when nasally administered. 

b) //, Vivo Neutralization Of C botulinum Type A Neurotoxin 
By Anti- Recombinant C botulinum C Fragment Antibodies 

The ability of the anti-C. botulinum type A toxin antibodies generated by nasal 
administration of recombinant botulinal fusion proteins in rats ( Lxample 22) to neutralize < ' 
hotuhnun, type A toxin was tested in a mouse neutralization model. The mouse model is the 
ar, accepted method for detection of botulinal toxins in body fluids and for the evaluation of 
anti-hotulinal antibodies IE J. Schantz and DA, Kautter. .1. Assoc. Off. Anal. Chem. 6 1 :96 
(1990) and Investigational New Drug (BB-IND-3703) application by the Surgeon General of 
the Department of the Army to the Federal Food and Drug Administration |. The ami -C. 
huiulinum type A toxin antibodies were prepared as follows. 

Rats Horn the group given pMBot protein by nasal administration were boosted a 
second time with 250 ng pMBot protein per rat and serum was collected 7 days later. Serum 
from one rat from this group and from a preimmune rat was tested for anti-C botulinum type . 
A toxin neutralizing activity in the mouse neutralization model described below. 

The I.D,„ of a solution of purilled C. botulinum type A toxin complex, obtained from 
Dr. Hric Johnson (University of Wisconsin Madison), was determined using the intraperitoneal 
(IP) method of Schantz and Kautter [J. Assoc. Off. Anal. Chem. 61:96 <I978>| usmg , 8 -22 
gram female ICR mice and was found to be 3500 I.D.,/ml. The determination of the LD<„ was 
performed as follows. A Type A toxin standard was prepared by dissolving purilled type A 
toxin complex in 25 mM sodium phosphate buffer. P H 6.8 to yield a stock toxin solution of 
3.15 x IU 7 LD< ( /mg. The OD ?7 . s of the solution was determined and the concentration was 
adjusted to 10-20 ug/ml. The toxin solution was then diluted 1:100 in gel-phosphate (30 mM 
phosphate, pll 6.4: 0.2% gelatin). Further dilutions of the toxin solution were made as shown 
below in Table 39. Two mice were injected ll> with 0.5 ml of each dilution shown and the 
mice were observed for symptoms of botulism for a period of 72 hours. 
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TABLE 39 

Determination Of The LD.„ Of Purified C hotulimm, Type . A Toxin Complex 



Dilution 



1:320 



Number Dead At 72 hr 



2/2 



1:640 



1:1280 



1:2560 



0/2 (sick after 72 hr> 



1:5120 



0/2 (no symptoms) 



From the results shown in Table 39. the toxin titer was assumed to be between 2560 
LD„/ml and 5120 LD>I (or about 3840 LD^ml). This value was rounded to 3500 
LD v /ml for the sake of calculation. 

The amount of neutralizing antibodies present in the serum of rats immunized nasally 
with pMBoi protein was then determined. Serum from two rats boosted with pMBot protein 
as described above and preimmune serum from one rat was tested as follows. The toxin 
standard was diluted 1:100 in gel-phosphate to a final concentration of 350 \A)<„/m\. One 
milliliter of the diluted toxin standard was mixed with 25 ul of serum from each ol the three 
rats and 0.2 ml of gel-phosphate. The mixtures were incubated at room temperature lor 30 
min with occasional mixing. F.ach of two mice were injected with IP with 0.5 ml of the 
mixtures. The mice were observed for signs ol - botulism for 72 hr. Mice receiving serum 
from rats immunized with pMBot protein neutralized this challenge dose. Mice receiving 
preimmune rat serum died in less than 24 hr. 

The amount of neutralizing anti-toxin antibodies present in the scrum ol" rats 
immunized- with pMBot protein was then quamiiated. Serum antibody titrations were 
performed by mixing 0.1 ml of each of the antibody dilutions (see fable 40) with 0.1 ml of a 
1:10 dilution of stock toxin solution (3.5 x 10 J l.iyml.) with 1.0 ml of gel -phosphate and 
injecting 0.5 ml IP into 2 mice per dilution. The mice were then observed for signs of 
botulism for 3 days (72 hr). The results are tabulated in Table 39. 

As shown in fable 40 pMBot serum neutralized C. hotulimm, type A toxin complex 
when used at a dilution of 1:320 or less. A mean neutralizing value of 168 IU/ml was 
obtained for the pMBot serum (an IIJ is defined as 10.000 mouse LO Vl ). This value translates 
to a circulating serum titer of about 3;7 UJ/mg of serum protein. This neutralizing titer is 
comparable to the commercially available bottled concentrated (Connaught Laboratories. Ltd.) 
horse ami-C". htmlinum antiserum. A 10 ml vial of Connaught antiserum contains about 200 
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ihg/ml of proteimeach ml can neutralize 750 IU of C hotidinum type A toxin. After 
administration of one vial to a human, the circulating serum titer of the Connaucht 
preparation would be approximately 25 lU/ml assuming an average serum volume of 3 liters). 
1 hus. the circulating anti-C * hntulimtm titer seen in rats nasally immunized with pMBol 
protein ( 168 lU/ml) is 6.7 time higher than the necessary circulation titer of anti-C : hotulmum 
antibody needed to be protective in humans. 

TABLE 40 



Dilution 


pMBat* 


Rat 1 


Rat 2 


1:20 


2/2 


2/2 


1 :40 


2/2 


2/2 


'1:80 


2/2 


2*2 


1:160 


2*2 


2 '** 


1:320 


2/2* 


2'2 h 


1 :640 


0/2 


0'2 


1:1280 


0/2 


0/2 


1:2560 


0/2 


0/2 



Numbers represent the number of mice surviving at 72 hours which received serum taken from 
nus immunized with the pMDot protein. 

These mice survived but were sick after 72 hr. 

These results demonstrate that antibodies capable of neutralizing c". hot ulinum type A 
toxin are induced when recombinant C. hotulmum C fragment fusion protein produced in £. 
colt is used as an immunogen. 

EXAMPLE 24 

Production Of Soluble C* hotulinum C Fragment 
Protein Substantially Free Of Endotoxin Contamination 



Example 23 -demonstrated that neutralizing antibodies are generated by immunization 
with the pMBol protein expressed in E, colt. These results showed that the pMBot fusion 
protein is u good vaccine candidate. However, immunogens suitable for use as vaccines 
should be pyrogen- free in addition to having the capability of inducing neutralizing 
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antibodies. Expression clones and conditions Chat facilitate the production of C. hondinum C 
fragment protein for utililization as a vaccine were developed. 

The example involved: (a) determination of pyrogen content of the pMBot protein: 
<b) generation ol C. hondmum C fragment protein free of the MBP: fc) expression of C, 
hondinum C fragment protein using various expression vectors: and (d) purification of soluble 
( hondmum C fragment protein substantially free of significant endotoxin contamination. 

a) Determination Of The Pyrogen Content Of The pMBot 
Protein 

In order to use a recombinant antigen as a vaccine in humans or other animals, the 
antigen preparation must be show., to he free of pyrogens. The most s.gniflcant pyrogen 
present in preparations of recombinant proteins produced in gram-negative bacteria, such as £ 
coh. is endotoxin [F.C. Pearson. Pyrogens: cndo.oxins. LAL fesnng and c/epvro^uaion. 
(1985, Marcel Dekkcr. New York. pp. 23-56], To evaluate the utility of the pMBot protein 
as a vaccine candidate, the endotoxin content in MBP fusion proteins was determined. 

The endotoxin content of recombinant protein samples was assayed utilizing the 
l.imulus assay (LAL kit: Associates of Cape Cod ) according to the manufacturer s 
instructions. Samples of affinity-purified pMal-c protein and pM A 1870-2680 were found to 
contain high levels of endotoxin f>50.()00 F.U/mg protein: l-U (endotoxin unit)]. This 
suggested that MBP- or toxin A repeat-containing fusions with the botulinal C fragment 
should also contain high levels of endotoxin. Accordingly, remov al of endotoxin from 
affinity-purified pMal-c and pMBot protein preparations was attempted as follows. 

Samples of pMal-c and pMBot protein were depyrogenated with polymyxin to 
determine if the endotoxin could be easily removed. The following amount of protein was 
treated: 2" ml at 4.8 OD :8( /ml for pMal-c and 19 mis at 1 .44 OD, M /ml for pMBot. The 
protein samples were dialyzed extensively against PBS and mixed in a 50 ml tube (Falcon) 
with 0.5 ml PBS-equilibrated polymyxin B (Affl-Prep Polymyxin. BioRad). The samples 
were allowed to mix by rotating the tubes overnight at 4°C. The polymyxin was pelleted by 
centrifugation for 30 min in a bench top centrifuge at maximum speed (approximately 2000 x 
X) and the supernatant was removed. The recovered protein (in the supernatant) was 
quantified by OD :!t(l . and the endotoxin activity was assayed by I.AL. In both cases only 
approximately 1/3 of the input protein was recovered and the polymyxin-trcaied protein 
retained significant endotoxin contamination (approximately 7000 L-U/mg of pMBot). 
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The depyrogenation experiment was repeated using an independently purified pMal-c 
protein preparation and similar results were obtained. From these studies it was concluded 
that significant levels of endotoxin copurifies with these MBP fusion proteins using the 
am> lose resin. Furthermore, this endotoxin cannot be easily removed by polymyxin 
treatment. 

These results suggest that the presence of the MBP sequences on the fusion protein 
complicated the removal of endotoxin from preparations of the pMBol protein. 

b) Generation Of C. hotulinum C Fragment Protein Free Of 
The MBP 

It was demonstrated that the pMBot fusion protein could not be easily purified from 
contaminating endotoxin in section a) above. The ability to produce a pyrogen-free (cy.. 
endotoxin-frec) preparation of soluble botulinal C fragment protein free of the MBP tag was 
next investigated. The pMBot expression construct was designed to facilitate purification of 
the botulinal C fragment from the MBP tag by cleavage of the fusion protein by utilizing an 
engineered Factor Xa cleavage site present between the MBP and the botulinal C fragment. 
The Factor Xa cleavage was performed as follows. 

factor Xa (New England Biolabs) was added to the pMBot protein (using a 0.1-1.0% 
factor Xa/pMBot protein ratio) in a variety of buffer conditions [e.g.. PBS-NaCI (PBS 
containing 0.5 M NaCI). PBS-NaCI containing 0.2% Tween 20. PBS. PBS containing 0.2% 
Tween 20. PBS-C (PBS containing 2 mM CaCK). PBS-C containing either 0.1 or 0.5 % 
Tween 20. PBS-C containing either 0.1 or 0.5% NP-40. PBS-C containing either 0.1 or 0.5% 
friton X-100. PBS-C containing 0.1% sodium deoxycholaie. PBS-C containing 0.1% SDSJ. 
I he f actor Xa digestions were incubated for 12-72 hrs at room temperature. 

The extent of cleavage was assessed by Western blot or Coomassie blue staining of 
proteins following electrophoresis on denaturing SDS-PAGE gels, as described in Example 
22. Cleavage reactions (and control samples of unc leaved pMBot protein) were centrifuged 
lor 2 min in a inicrofuge to remove insoluble protein prior to loading the samples on the gel. 
The f actor Xa treated samples were compared with uncleavcd. uncentrifuged pMBot samples 
on the same gel. The results of this analysis is summarized below 

I ) Most (about 90%) pMBot protein could be removed by centrifugation. even 
when uncleavcd control samples were utilized. This indicated that the pMBot fusion protein 
was not fully soluble {i.e.. it exists as a suspension rather than as a solution). [This result was 
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consistent with the observation thai most affinity-purified pMBot protein precipitates after 
long term storage (>2 weeks) at 4°C. Additionally, the majority [i.e.. 75%) of induced 
pMBot protein remains in the pellet after sonication and clarification of the induced E. coli. 
^suspension of these insoluble pellets in PBS followed by sonication results in partial 
solubilization of the insoluble pMBot protein in the pellets. | 

2) The portion of pMBot protein that is fully in solution (about 10% of pMUot 
protein) is completely cleaved by Factor Xa. but the cleaved (released) botulinal C fragment is 
relatively insoluble such that only the cleaved MBP remains fully in solution. 

3) None of the above reaction conditions enhanced solubility without also 
reducing effective cleavage. Conditions that effectively solubilized the cleaved botulinal C 
fragment were not identified. 

4) The use of 0. 1% SDS in the buffer used for Factor Xa cleavage enhanced the 
solubility of the pMBot protein (all of pMBot protein was soluble). However, the presence of 
the SOS prevented any cleavage of the fusion protein with Factor Xa. 

1:1 r>) Analysis of pelleted protein from the cleavage reactions indicated that both full 

length pMBot </.i V . uncleaved) and cleaved botulinal C fragment protein precipitated during 
incubation. 

These results demonstrate that purification of soluble botulinal C fragment protein after 
cleavage of the pMBot fusion protein is complicated by the insolubility of both the pMBot 
20 protein and the cleaved botulinal C fragment protein. 

e) Expression Of C. botulinum C Fragment Using Various 
Expression Vectors 

In order to determine if the solubility of the botulinal C fragment was enhanced by 
25 expressing the C fragment protein as a native protein, an N-terminal His-lagged protein or as 
a fusion with glutathione-S-transferase (GST), alternative expression plasmids were 
constructed. These expression constructs were generated utilizing the methodologies described 
in Example 22. Figure 27 provides a schematic representation of the vectors described below. 
In Figure 27. the following abbreviations are used, pp refers to the pFT23 vector. 
30 pHIS refers to the pETHisa vector. pBlue refers to the pBluescript vector: pM refers to the 
pMAL-c vector and pG refers to the pGEX3T vector (described in Example 1 1 ). I he solid 
black lines represent C. hoiulinum C fragment gene sequences: the solid black ovals represent 
the MBP: the hatched ovals represent GST: "HHHHH" represents the poly-histidine tag. In 
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Figure 27. when the name for a restriction enzyme appears inside parenthesis, this indicates 
that the restriction site was destroyed during construction. An asterisk appearing with the 
, name lor a restriction enzyme indicates that this restriction site was recreated at a cloning 
junction. 

i) Construction Of pPBot 

In order to express the C. hoiulimtm C fragment as a native {i.e.. non-fused) protein, 
the pPttot plasmid (shown schematically in Figure 27) was constructed as follows, the C* 
fragment sequences present in pAlterBot (Example 22) were removed by digestion of 
pAlterBot with Nco\ and ffiniflU. The NcoVllindlU C fragment insert was ligatcd to 
pETIIisa vector (described in Example 1Kb)' which, was digested with Kcol and HindlH. This 
ligation creates an expression construct in which the Aval -encoded methionine of the botulinal 
C fragment is the initiator codon and directs expression of the native botulinal C fraumcnt. 
The ligation products were used to transform competent BL2I(DE3)pI.ysS cells (Novation). 
Recombinant clones were identified by restriction mapping. 

ii) Construction Of pHisBot 

In order to express the C* Botulinum C fragment containing a polv-histidinc lag at the 
amino-terminus of the recombinant protein, the pHisBot plasmid (shown schematically in 
Figure 27) was constructed as follows. The A'col/Him/lU botulinal C fragment insert from 
pAllerbot was ligatcd into the pETIIisa vector which was digested with AViel and UindXW. 
The Xcol (on the C fragment insert) and Nhe\ (on the pETHisa vector) sites were 11 lied in 
using the klenovv fragment prior to ligation; these sites were then blunt end ligated (the A</cl 
site was regenerated at the clone junction as predicted). The ligation products were used to 
transform competent BL2HDE3)pLysS cells and recombinant clones were identified by 
restriction mapping. 

The resulting pHisBot clone expresses the botulinal C fragment protein with a 
histidine-tagged N-terminal extension having the following sequence: MetGIyl lis! lis 
HisHisHisliisHisHisHisllisSer ( SEQ [D NO:24): the amino 

acids encoded by the botulinal C fragment gene arc underlined and the vector encoded amino 
acids are presented in plain type. The nucleotide sequence present in the pETIIisa vector 
which encodes the pHisBot fusion protein is listed in SEQ ID NO:25. The amino acid 
sequence of the pHisBot protein -is listed in SEQ ID NO:26. 
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Hi) Construction Of pGBot 

The botulinal C fragment protein was expressed as a fusion with the glutathione-S- 
transferase protein by constructing the pGBot plasmid (shown schematically in Fiuure ">7) 
This expression construct was created by cloning the AV„J/.SV,/J C fragment insert present in 
pBlueBo, (Example 22) into the P GEX3T vector which was digested with .S>„«, and Xhol 
The .Wl site (present on the botu.inal fragment, was made blunt prior to liuation using the 
Wenow fragment. The ligation products were used to transform competent BL2I cells. 

Each of the above expression constructs were tested by restriction digestion to confirm 
the integrity of the constructs. 

Large scale ( I liter) cultures of pPBot fBL2l(DE3)pLvsS host|. pHisBoi 
|BL21(DE3,pLysS host) and pGBot (BL2I host, were grown in 2XYT medium and induced 
(usmg .PTG to 0.8-1.0 mM) for 3 hrs as described in Example 22. Total, soluble and 
.nso.ub!e protein preparations were prepared from 1 ml aliuuots of each larue scale culture 
IWdliams e, „/. < .994,. .v,,,™, and analysed by SDS-PAGE. No obvious induced band w as 
detectable in the pPBot or pHisBot samples by Coomassie staininu. while a prominent 
insoluble band of the anticipated MW was detected in the pGBot sample. Soluble Ivsates of 
the pGBot large scale tresuspended in PBS) or plhsBot large scale (resuspended in Novagen 
IX bmdmg .buffer (5 mM imidazole. 0.5 M Nad. 20 n,M Tris-.lCI. pl, 7.9), eu.tures were 
prepared and used to affinity purify soluble affinity-tagged protein as follows. 

The P GBo, lysate was affinity purified on a glutathione-agarose resin (Pharmacia, 
exactly as described in Smith and Corcoran fCurrcnt Protocols in Molecular Biology 
Supplement 28 (.094). pp . .6.7.1- U,7.7|. The pHisBo, protein was purified on the Mis-Bind 
res,„ 4 Novagen , utilizing the His-bind buffer kit (Novagen, exactly as described by 
manufacturer. 

Samples from the purification of both the pGBot and pHisBot proteins (including 
uninduced. induced, total, soluble, and affinity-purified eluted protein) were resolved o.tSDS- 
PAGE gels. Following electrophoresis, proteins were analyzed by Coomassie s.aininu or by 
Western blot detection utilizing a chicken anti-C. W«,,«n I ypc A toxoid antibody ias 
described in Example 22). 

These studies showed that the pGBot protein was almost entirely insoluble under the 
utilized conditions, while the pHisBot protein was soluble. Affinity purification of the 
PHisBot protein on this first attempt was inefficient, both in terms of yield (most of the 
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immunoreactive botulinal protein did not bind to the His-bind resin) and purity (the botulinal 
protein was estimated to comprise approximately 20% of the total eluted protein). 

d) Purification Of Soluble C. botutinum C Fragment Protein 
Substantially Free Of Endotoxin Contamination 

The above studies showed that the pHisBot protein was expressed in £ tali as a 
soluble protein. However, the affinity purification of this protein on the His-bind resin was 
very inefficient. In order to improve the affinity purification of the soluble pHisBot protein 
(in terms of both yield and purity), an alternative poly-histidine binding affinity resin (Ni- 
NTA resin: Qiagen) was utilized. The Ni-NTA resin was reported to have a superior binding 
affinity (K,- I x 10 ,; at pH 8 0: Qiagen user manual) relative to the His-bind resin. 

•V soluble lysate (in Novagen IX binding buffer) from an induced I liter 2X YT 
culture was prepared a.s described above. Briefly, the culture of pHisBot fl3l2 l(Dn3)pLysS 
hosi| was grown at 37°C to an OD WK1 of 6.7 in I liter of 2X YT medium containing 100 
ug/ml umpiciltin. 34 ng/ml chloramphenicol and 0.2% glucose. Protein expression was 
induced by the addition of IPTG to 1 mM. Three hours after the addition ol the IPTG. the 
cells were cooled lor 15 min in a ice water bath and then centrifuged 10 min at 5000 rpm in 
a JAIO rotor (Ueckman) at 4°C. The pellets were resuspended in a total volume of 40 mis 
Novagen IX binding buffer (5 mM imidazole. 0.5 M NaCl. 20 mM Tris-HCl. pH 7.9). 
transferred to two 35 mi Oakridge tubes and frozen at -70°C (or at least I hr. The tubes 
were thawed and the cells were lysed by sonication (4 X 20 second bursts using a Branson 
Soniller 450 with a power setting of 6-7) on ice. The suspension was clarified by 
eenirifiigalion for 20 min at 9.000 rpm ( ! 0.000 x & in a JA-'I 7 rotor ( Beckman >. . 

The soluble lysate was brought to 0.1% NP40 and then was batch absorbed to 7 ml of 
a 1:1 slurry of Ni-NTA resin:binding buffer by stirring for I hr at 4°C. The slurry was 
poured into a column having an internal diameter of I or 2.5 cm (BioRad). The column was 
then washed sequentially with 15 mis of Novagen IX binding buffer containing 0.1% NP40. 
15 ml of Novagen IX binding buffer. 15 ml wash buffer (60 mM imidazole. 0.5 M NaCl 20 
mM Tris-HCl. pH 7.<» and 15 ml NaHPO, wash buffer (50 mM NaHPO,. pi I 7.0. ().3 M 
NaCl. 10 % glycerol). The bound protein was eluted by protonation of the resin using elution 
buffer (50 mM NaHPO,. pH 4.0. 0.3 M NaCl. 10 % glycerol). The eluted protein was stored 
at 4°C. 
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Samples of total, soluble and eluted protein were resolved by SDS-PAGK. Protein 
samples were prepared for electrophoresis as described in Hxample 22b. Duplicate gels were 
stained with Coornassie blue to visualize the resolved proteins and C. lioiulinum type A toxin- 
reactive protein was detected by Western blot analysis as described in lixample 22b. A 
representative Coornassie stained gel is shown in Figure 28. In Figure 28. the following 
samples were loaded on the 12.5% acrylamide gel. Lanes 1-4 contain respectively total 
protein, soluble protein, soluble protein present in the How-through of the Ni-NTA column 
and affinity-purified pHisBot protein (i.e.. protein released from the Ni-NTA resin by 
protonatipn). Lane 5 contains high molecular weight protein markers (BioRad). 

The purification of pHisBot protein resulted in a yield of 7 mg of affinity purified 
protein from a 1 liter starting culture of BL2l(DE3)pLysS cells harboring the pHisBot 
plasmid. The yield of purified pHisBot protein represented approximately 0.4% of the total 
soluble protein in the induced culture. Analysis of the purified pHisBot protein 'by SDS- 
IVtfiK. revealed that at least 90-95% of the protein was present as a single band (Figure 28) of 
the predicted MW (50 kD). This 50 kD protein band was immunoreactive with ami-C. 
htuulinutn .type. A toxin antibodies. The extinction coefficient of the protein preparation was 
determined to be 1.4 (using the Pierce BC A assay) or 1.45 (using the l.owry assay ) ()[),„„ per 
I mg/ml solution. 

Samples of pi I neutralized eluted pHisBot protein were resolved on a KB 803 HPLC 
column (Shodex). Although His-tagged proteins are retained by this sizing column l perhaps 
due to the inherent metal binding ability of the proteins), the relative mobility of the pHisBot 
protein was consistent With that expected for a non-aggregated protein in solution. Most of 
the induced pHisBot protein Was determined to be soluble under the growih and solubilization 
conditions utilized above (i.e.. greater than 90% of the pHisBot protein was found to be 
soluble as judged by comparison of the levels of pHisBot protein seen in total and soluble 
protein samples prepared from BL2 l(Dl£3)pLysS cells containing the pHisBot plasmid). 
SDS-PACiE analysis of samples obtained alter centrifugation. extended storage at -20°C. and 
at least 2 cycles of freezing and thawing detected no protein loss (due to precipitation), 
indicating that the pHisBot protein is soluble in the elution buffer [i.e.. 50 mM NallPO,. pH 
4.0. 0.3 M NaCl. 10 % glycerol). 

Determination of endotoxin contamination in the affinity purified pHisBot preparation 
(after pH neutralization) using the I.AL assay (Associates of Cape Cod) detected no 
significant endotoxin contamination. The assay was performed using the endpoinl 
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chromogenic method (withoutdiazo-coupling) according to the manufacturer" s instructions. 
This method can detect concentrations of endotoxin greater than or equal to 0 03 EU/ml (EU 
refers to endotoxin units). The LAL assay was run using 0.5 ml of a solution comprising 0.5 
mg pHisBot protein in 50 mM NaHPO,. pi f 7.0. 0.3 M NaCI. 10 % glycerol; 30-60 EU were 
detected in the 0.5 ml sample. Therefore, the affinity purified pHisBot preparation contains 
60-120 EU/mg of protein. FDA Guidelines for the administration of parenteral drugs require 
that a composition to be administered to a human contain less than 5 EU/kg body weight (The 
average human body weight is 70 kg: therefore up to 349 EU units can be delivered in a 
parental dose.). Because very small amount of protein are administered in a vaccine 
preparation (generally in the range of 10-500 ng of protein), administration of affinity 
purified pHisBot containing 60-120 EU/mg protein would result in delivery of only a small 
percentage -of the permissible endotoxin load. For example, administration of 10-500 |ig of 
purified pllisBot to a 70 kg human, where ihe protein preparation contains 60 EU/mg protein, 
results in the introduction of only 0.6 to 30 EU |/.c.. 0.2 to 8.6% of the maximum allowable 
endotoxin burden per parenteral dose (less than 5 EU/kg body weight)]. 

The above results demonstrate that endotoxin (LPS) does not copurify with the 
pHisBot protein using the above purification scheme. Preparations of recombinantlv produced 
pHisBot -protein containing lower levels of endotoxin (less than or equal to 2 EU/ mg 
recombinant protein) may be produced by washing the Ni-NTA column with wash buffer until 
the OD, sn returns to baseline levels (i.e.. until no more UV-absorbing material comes off of 
the column). 

The above results illustrate a method for the production and purification of soluble, 
botulinal C fragment -protein substantially free of endotoxin. 

EXAMPLE 25 

Optimization Of The Expression And Purification Of pHisBot Protein 

The results shown in Example 24d demonstrated that the pHisBot protein is an 
excellent candidate for use as a vaccine as it could be produced as a soluble protein in caii 
and could be purified free of pyrogen activity. In order to optimize the expression and 
purification of the pHisBot protein, a variety of growth and purification conditions were 
tested. 
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a) Growth Parameters 
i) Host Strains 

The influence of the host strain utilized upon the production of soluble pi lisBol 
protein was investigated. A large scale purification of pHisBot was performed Tas described 
in Example 24d above] using the BL2I(DE3) host (Novagen) rather than the 
BL2l(I)E3)pLysS host. The deletion of the pLysS piasmid in the BL21(DE3) host yielded 
higher levels of expression due to de-repression of the plasmids T7-lac promoter. However, 
the yield of affinity-purified soluble recombinant protein was very low (approximately 600 
My/ liter culture) when purified under conditions identical to those described in Example 24d 
above. This result was due to the fact that expression in the BL2HDE3) host yielded very 
high level expression of the pllisBot protein as insoluble inclusion bodies as shown by SDS- 
I'AGE analysis of protein prepared from induced BL2 1 (DE3) cultures (Figure 20. lanes 1-7. 
described below). These results demonstrate that the pllisBot protein is not inherently toxic 
to E. voii ceils and can be expressed to high levels using the appropriate promoter/host 
combination. 

ligure 29 shows a Coomassie blue stained SDS-PAGE gel (12.5% acrylamide) onto 
which extracts prepared from BL21(DE3) cells containing the pHisBot piasmid were loaded. 
Each lane was loaded with 2.5 ul protein sample mixed with 2.5 ul of 2X SDS sample buffer. 
The samples were handled as described in Example 22b. The following samples were applied 
to the gel. Lanes 1-7 contain protein isolated from the BL2I(DE3) host. Lanes 8-14 contain 
proteins isolated from the BL2l(DE3)pLysS host. Total protein was loaded in lanes I. 2. 4. 
6. 8. 10 and 12. Soluble protein was loaded in Lanes 3. 5. 7. 9. 1 1 a „d 13. Lane I contains 
protein from uninduced host cells. Lanes 2-13 contain protein from host cells induced for 3 
hours. IPTO was added to a final concentration of 0. 1 mM (Lanes 6-7). 0.3 mM (Lanes 4-5) 
or 1.0 mM (Lanes 2. 3. 8-13). The cultures were grown in LB broth (Lanes X-9). 2X YT 
broth (Lanes 1 0-1 1) or terrific broth (Lanes 1-7. 12-13). The pllisBot protein seen in Lanes 
3. 5 and 7 is insoluble protein which spilled over from Lanes 2. 4 and 6. respectively. I ligh 
molecular weight protein markers (BioRad) were loaded in Lane 14. 

A variety Of expression conditions were tested to determine if the B1.2KDE3) host 
could be utilized to express soluble pHisBot protein at suitably high levels [i.e.. about 10 
mg/mh. The conditions altered were temperature (growth at 37 or 30°C). culture medium 
<2X YT. LB or Terrific broth) and inducer levels (0.1. 0.3 or 1.0 mM IPTG). All 
combinations of these variables were tested and the induction levels and solubility was then 
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assessed by SDS-PAGE analysis of total and soluble extracts [prepared from 1 ml samples as 
described in Williams ei at.. (1994). supra). 

All cultures were grown in 15 ml tubes (Falcon #2057). All culture medium was 
prewarmed overnight at the appropriate temperature and were supplemented with 100 M g/ml 
ampicillin and 0.2% glucose. Terrific broth contains 12 g/1 hacto-tryptone. 24 g/| bacto-ycast 
extract and 100 ml/I of a solution comprising 0.17 M KH : POv 0.72 M KIIPO,. C ultures 
were grown in a incubator on a rotating wheel (to ensure aeration) to an OI) NK1 of 
approximately 0.4. and induced by the addition of IPTG. In all cases, high level expression 
of insoluble pHisBot protein was observed, regardless of temperature, medium or inducer 
concentration. 

The elTcct of varying the concentration of IPTG upon 2X YT cultures grown at 23°C 
. was then investigated. IPTG was added to a final concentration of either 1 mM . 0. 1 m M 
0.05 mM or 0 .01 mM . At this temperature-, similar levels of P His Hot protein was induced in 
the presence of either I or 0. 1 mM IPTG: these levels of expression was lower than that 
observed at higher temperatures. Induced protein levels were reduced at 0.05 mM IPTG and 
absent at 0.01 mM IPTG (relative to 1.0 and 0.1 mM IPTG inductions at 23°C). However. 
n<> conditions were observed in which the induced pHisBot protein was soluble in this host. 
Thus, although expression levels are superior in the BL2KDF.3) host (as compared to the 
»L2l<DC3>pL.ysS host), conditions that facilitate the production of soluble protein in this host 
could not be identified: 

These results demonstrate that production of soluble pHisBot protein was achieved 
using the BL2 1 ( 15E3 )pl ysS host in conjunction with the T7-| at promoter. 

ii) Effect Of Varying Temperature, Medium And 
IPTG Concentration And Length Of Induction 

The effect growing the host cells in various mediums upon the expression of 
recombinant hotulinal protein from the pHisBot expression construct [in the BL2 1(1)1-3 jphysS 
host| was investigated. BL2l(DF.3)pLysS cells contatning the pHisBot plasmid were grown 
"i either LB. 2X YT or Terrific broth at 37X. The cells were induced using I m M IPTG for 
•a 3 hr induction period. Expression of pHisBot protein was found to be the highest when the 
cells were grown in 2X YT broth (see Figure 29. lanes 8-13). 

The cells were then grown at 30°C in 2X YT broth and the concentration of IPTG was 
varied from 1.0. 0.3 or 0.1 mM and the length of induction was cither 3 or 5 hours. 
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Expression of pHisBot protein was similar at all 3 inducer concentrations utilized and the 
levels of induced protein were higher after a 5 hr induction as compared to a 3 hr induction. 

Using the conditions found to be optimal for the expression of pHisBot protein, a large 
scale culture was grown in order to provide sufficient material for a large scale purification of 
the pHisBot protein. Three 1 liter cultures were grown in 2X YT medium containing 100 
Mg/ml ampicillin. 34 |Ag/ml chloramphenicol and 0.2% glucose. The cultures were grown at 
30°C and were induced with 1.0 mMIPTG for a 5 hr period. The cultures were harvested 
and a soluble lysatc were prepared as described in Example IX A large scale purification 
was performed as described in Example 24d with the exception that except the soluble lysate 
was batch absorbed for 3 hours rather than for I hour. I he final yield was 13 mg pHisBot 
protein/liter culture. The pHisBot protein represented 0.75% of the total soluble protein. 

The above results demonstrate growth conditions under which soluble p! I isBol protein 
is produced [i.e.. use of the BL2l(DE3)pLysS host. 2X YT medium. 30°C\ 1.0 mM IPTG for 
5 hours). 



h) Optimization Of Purification Parameters 

l or optimization of purification conditions, large scale cultures (3X1 liter) were 
grown at 30°C and induced with I mM IPTG for 5 hours as described above. The cultures 
were pooled, distributed to centrifuge bottles, cooled and pelleted as described in Example 
20 24d, The cell pellets were frozen at -70°C until used. Each cell pellet represented 1/3 of a 
liter starting culture and individual bottles were utilized for each optimization experiment 
described below. This standardized the input bacteria used lor each experiment, such that the 
yields of affinity purified pllisBot protein could be compared between different optimization 
experiments. 



i) Binding Specificity (pll Protonation) 

A lysate of pHisBot culture was prepared in PBS (pll 8.0> and applied to a 3 ml Ni- 
NTA column equilibrated in PBS (pH 8.0) using a How rate of 0.2 ml/min { 3-4 column 
voiumes/hr) using an Econo chromatography system (BioKad). The column was washed with 
PBS (pl l 8.0) until the absorbance (Ol) :xn ) of the elutc was at baseline levels. The How rate 
was then increased to 2 ml/min and the column was equilibrated in PBS (pll 7.0). A pll 
gradient (pll 7.0 to 4.0 in PBS) was applied in order to clute the bound pHisBot protein from 
the column, fractions were collected and aliquots were resolved on SDS-PAGH gels. The 
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PAGE gels were subjected to Western blotting and the pHisBot protein was detected using a 
chicken anti-C. hotulinum Type A toxoid antibody as described in Example 22. 

From the Western blot analysis it was determined that the pHisBot protein begins to 
elute from the Ni-NTA column at pH 6.0. This is consistent with the predicted elution of a 
Mis-lagged protein monomer at pH 5.9. 

These results demonstrate that the pH at which the pHisBot protein is protqnated 
(released) from Ni-NTA resin in PBS buffer is pH 6.0. 

ii) Binding Specificity (Imidazole Competition) 

In order to define purification conditions under which the native £ «,// proteins could 
be removed from the Ni-NTA column while leaving the pHisBot protein bound to the 
column, the following experiment was performed. A lysate of pHisBot culture was prepared 
in 50 n,M NaHPO,. 0.5 M NaCI. 8 n,M imidazole ( P H 7.0). This lysate was applied to a .1 
ml Ni-NTA column equilibrated in 50 mM NaHPO,. 0.5 M NaCI (pH 7.0) using an Econo 
chromatography system (BioRad). A How rate of 0.2 ml/min (3-4 column volumes/hr) was 
utilized. .The column was washed with 50 mM NaHPO,. 0.5 M NaCI (pH 7.0) until the 
absorbance of the elute returned to baseline. The How rale was then increased to 2 ml/min. 

Ihe column was elutcd using an imidazole step gradient (in 50 mM NaHPO,. 0.5 M 
NaCI- (pi I 7.0)|. F.lution steps were 20 mM. 40 mM. 60 mM. 80 mM. 100 mM. 200 mM. 1.0 
M imidazole, followed by a wash using 0.1 mM EDTA (to strip the nickel' from the column 
and remov e any remaining protein). In each step, the wash was continued until the Of),,,, 
returned to baseline. Tractions were resolved on SDS-PAGF. gels. Western blotted, and 
pHisBot protein detected using a chicken anti-C hmulinwv Type A toxoid antibody as 
described in Example 22. Duplicate gels were stained with Coomassie blue to detect elutcd 
protein in each fraction. 

The results of the PACE analysis showed that most of the non-specifically binding 
bacterial protein was removed by the 20 mM imidiazole wash, with the remaining bacterial 
proteins being removed in the 40 and 60 mM imidazole washes. I he pHisBot protein began 
to elute at 100 mM imidazole and was quantitatively eluted in 200 mM imidazole. 

These results precisely defined the window of imidazole wash stringency that 
optimally removes £. coli proteins from the column while specifically retaining the pHisBot 
protein in this buffer. These results provided conditions under which the pHisBot protein can 
be purified free ol contaminating host proteins. 
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Hi) Purification Buffers And Optimized 
Purification Protocols 

A variety of purification parameters were tested during the development of an 
optimized protocol for batch purification of soluble pHisBot protein. The results or these 
analyses arc summarized below. 

Batch purifications were performed (as described in Lxample 24d) using several 
buffers to determine if alternative buffers could be utilized for binding of the pHisBot protein 
to the Ni-NTA column. It was determined that quantitative bindmg of pHisBot protein to the 
Ni-NTA resin was achieved in either Tris-HCI (pH 7.9) or NaHPO, (pi I 8.0) buffers. 
Binding of the pHisBot protein in NaHPO, buffer was not inhibited using 5 mM. 8 inM or 60 
mM imidazole. Quantitative clution of bound pHisBot protein was obtained in buffers 
containing 50 mM NallPO,. 0.3 M NaCI (pH 3.5-4.0). with or without 10% glycerol. 
However, quantitation of soluble affinity purified pHisBot protein before and after a freeze 
thaw (following several weeks storage of the affinity purified elute at -2()°C> revealed that 
"4% of the protein was recovered using the glycerol-containing buffer, but only 68% of the 
protein was recovered when the buffer lacking glycerol was employed. This demonstrates 
thai glycerol enhanced the solubility of the pHisBot protein in this low pU buffer when the 
eluted protein was stored at freezing temperatures -20°C). Neutralization of pH by 

addition of NaH,POj buffer did not result in obvious protein precipitation. 

It was determined that quantitative binding of pHisBot protein using the batch format 
occurred alter 3 hrs (Figure 30). but not after 1 hr of binding at 4°C (the resin xvas stirred 
during binding). Figure 30 depicts a Coomaissc blue stained SDS-PACii; gel < 7.5% 
acrylamide) containing samples of proteins isolated during the purification of pHisBot protein 
from lysatc prepared from the BL21(DF3)pLysS host. Hach lane was loaded with 5 M l of 
protein sample mixed with 5 ul of 2X sample buffer and processed as described in Example 
22b. Lane I contains high molecular weight protein markers (BioRad). l anes 2 and 3 
contain proiein eluted from the Ni-NTA resin. Lane 4 contains soluble proiein alter a 3 hr 
batch incubation with the Ni-NTA resin. Lanes 5 and 6 contain soluble and total protein, 
respectively. Figure 30 demonstrates that the pHisBot proiein is completely soluble Icompare 
Lanes 5 and 6 which show that a similar amount of the 50 kD pHisBot protein is seen in 
both: if a substantial amount (greater than 20%) of the pHisBot proiein were partially 
insoluble in the host cell, more pHisBot protein would be seen in lane 6 (total protein) as 
compared to lane 5 (soluble protein) |. Figure 30 also demonstrates that the pHisBot proiein is 
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completely removed from the lysate after batch absorption with the Ni-NTA resin for 3 hours 
(compare Lanes 4 and 5). 

1 he reported high affinity interaction of the Ni-NTA resin with His-tagged proteins 
(K d = I x 10 ' at pH 8.0) suggested that it should be possible to manipulate the resin-protein 
complexes without significant release of the bound protein. Indeed, it was determined that 
after the recombinant protein was bound to the Ni-NTA resin, the resin-pHisBol protein 
complex was highly stable and remained bound following repeated rounds of centrifugation of 
the resin for 2 min at 1600 x When this centrifugation step was performed in a 50 ml tube 
(Falcon), a tight resin pellet formed. This allowed the removal of spent soluble lysate by 
pouring off the supernatant followedJ>y resuspension of the pellet in wash buffer. Further 
washes can be performed by centrifugation. The ability to perform additional washes permits 
the development of protocols for batch absorption of large volumes of lysate with removal of 
the lysate being performed simply by centrifugation following binding of the recombinant 
protein to the resin. 

A simplified, integrated purification protocol was developed as follows. A soluble 
lysate was made by rcsuspending the induced cell pellet in binding buffer [50 mM NaHP0 4 . 
0.5 M NaCI. 60 mM imidazole (pH X.0)|. sonicating 4 x 20 sec and centrifuging for 20 min 
at 10.000 x K . NP-40 was added to 0.1% and Ni-NTA resin (equilibrated in binding buffer) 
was added, liight milliliters of a 1:1 slurry (resjn.binding buffer) was used per liter of 
starting culture. The mixture was stirred for 3 hrs at 4°C. The slurry was poured into a 
column having a I cm internal diameter (BioRad). washed with binding buffer containing 
0.1% NP40. then binding buffer until baseline was established (these steps may alternatively 
be performed by centrifugation of- the resin, resuspension in binding buffer containing NIMO 
followed by centrifugation and resuspension in binding buffer). Imidazole was removed by 
washing the resin with 50 mM NaHPO,, 0.3M NaCI (pH 7.0). Protein bound to the resin was 
elutcd using the same buffer (50 mM NaHPO.,. 0.3M NaCI) having a reduced pH (pH 3.5- 
4.0). 

A pilot purification was performed following this protocol and yielded 18 mg/litcr 
affinity-purified pMisBot. The pHisBot protein was greater than *>()% pure as estimated by 
Coomassie staining of an SDS-PACil: gel. This represents the highest observed yield of 
soluble affinity-purified pHisBot protein and this protocol eliminates the need for separate 
imidazolc-comaining binding and wash buffers. In addition to providing a simplified and 
efficient protocol for the affinity purification of recombinant pHisBot protein, the above 
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results provide a variety of purification conditions under which pHisBot protein can be 
isolated. 



EXAMPLE 26 

The pHisBot Protein Is All Effective Immunogen 

In Example 23 it was demonstrated that neutrality antibodies arc generated in mouse 
serum alter nasal immunization with the pMBot protein. However, the pMBot protein was 
found to copurify with significant amounts of endotoxin which could not be easilv removed 
The pHisBot protein, in contrast, could be isolated free of significant endotoxin contamination 
making pH.sBot a superior candidate for vaccine production. To further assess the suitability 
of PHisBot as a vaccine, the immunogenic^ of the pHisBot protein was determined and a ' 
comparison of the relative immunogenic^ of pMBot and pHisBo, proteins in mice was 
performed as follows. 

Two groups of eight BALBc mice were immunized with either pMBot protein or 
pHisBot protein using Cierbu CiMDP adjuvant (CC Biotech). pMBo, protein (in PBS 
containing 10 mM maltose) or pHisBot protein (in 50 mMNaHPO,. 0.3 M Nad. 10% 
glycerol. P l| 4.0) was mixed with (ierbu adjuvant and used to immunize mice, Each mouse 
received an IP injection of 100 M l antigen/adjuvant mix (50 M g antigen plus I M g adjuvant) on 
day 0 Mice were boosted as described above with the exception that the route of 
administration was IM on day 14 and 28. The mice were bled on day 77 and anti-C 
haml.nun, Type A toxoid titers were determined using serum collected from individual mice 
in each group (as described in Example 23). The results are shown in Table 41 
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TABI*: 41 
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The results shown above in Table 4 1 demonstrate that both the pMBot and pHisBot 
proteins are immunogenic in mice as 100% of the mice (8/8) in each group seroconvcrted 
Irom non-immune to immune status. The results also show that the average titer of anti-CV 
haiutimtm Type A toxoid lgG is 2-.1 fold higher after immunization with the pHisBot protein 
relative to immunization with the pMBot protein. This suggests that the pHisBot protein may 
be a superior immunogen to the pMBot protein. 

EXAMPLE 27 

Immunization With The Recombinant 
pHisBot Protein Generates Neutralizing Antibodies 



The results shown in Example 26 demonstrated that both the pllisBol and pMBot 
proteins were capable of inducing high liters of anti-C. hmutinnm type A toxoid-reactive 
antibodies in immunized hosts. The ability of the immune sera from mice immunized with 
either the pHisBot or pMBot proteins to neutralize ( ' hotulimmi type A toxoid in vivo was 
determined using the mouse neutralization assay described in Example 23b. 
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The two groups of eight BALBc mice immunized vviih either pMBot pro , e i n or 
pHisBot protein in Example 26 were boosted again one week after the bleeding on dav 77. 
The boost was performed by mixing pMBoi protein (in PBS containing 10 mM maltose) or 
pHisBot protein (in 50 mM NaHPO,. 0.3 M NaCI. 10% glycerol. pH 4.0) with Gerbu 
adjuvant as described in Example 26. Each mouse received an IP injection of 100 ul 
antigen/adjuvant mix (50 ug antigen plus 1 ug adjuvant) The mice were bled 6 davs after 
lh« boost and the serum from mice within a group was pooled. Serum from preimmune mice 
was also collected (this serum is the same serum described in the footnote to Table 41). 

The presence of neutralizing antibodies in the pooled or preimmune scrum was 
detected by challenging mice with 5 LD 5 „ units of type A toxin mixed with 100 ul of pooled 
serum. The challenge was performed by mixing (per mouse to be injected) 100 ul of serum 
from each pool with 100 ul of purified type A toxin standard (50 l.D,„ /ml prepared as 
described in Example 23b, and 500 ul of gel-phosphate. The mixtures were incubated for 30 
min at room temperature with occasional mixing. Each of four mice were injected IP with 
the mixtures (0.7 ml/mouse). The mice were observed tor signs of botulism for 72 hours. 
Mice receiving toxin mixed with serum from mice immunized with either the pHisBot or 
pMBot proteins showed no signs of botulism intoxication. In contrast, mice receiving 
preimmune serum died in less than 24 hours. 

These results demonstrate that antibodies capable of neutralizing f htndmum type A 
toxin are induced when either of the recombinant C. hotulmum C fragment proteins pHisBot 
or pMBot are used as immunogens. 

EXAMPLE 28 

Cloning And Expression Of The C Fragment of ( . hotulmum 
Serotype A Toxin In E. cull Utilizing A Native Gene fragment 

In Example 22 above, a synthetic gene was used to express the C fragment of C. 
hmuHnum serotype A toxin in £ „,//. The synthetic gene replaced non-preferred (/.,.. rare) 
codons present in the C fragment gene with codons which are preferred by /•. <•„//. The 
synthetic gene was generated because it was been reported that genes which have a high A/T 
content (such as most clostridial genes) creates expression difficulties in E. coli and yeast. 
Furlhermore. LaPenoticre vl «/. suggested that problems encountered with the stability (non- 
fusion constructs) and solubility (MBP fusion constructs) of the C fragment of C hondinum 
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serotype A toxin when expressed in £. cW, was most likely due to the extreme A/T richness 
of the native C. bmulinum serotype A toxin gene sequences (LaPenotiere. a at., supra). 

In this example, it was demonstrated that successful expression of the C fragment of 
C hun.linun, type A toxin gene in E. ,nli does not require , he elimination of rare codons 
</>.. there is no need to use a synthetic gene). This example involved a) the clonine of the 
native C fragment of the C. W/,„, w serotype A toxin gene and construction of an^ 
expression vector and b) a comparison of the expression and purification yields of C 
honU.nun, serotype A C fragments derived from native and synthetic expression vectors. 

a) Cloning Of The Native C Fragment Of The C. botutinum 

Serotype A Toxin Gene And Construction Of An Expression 
Vector 

The serotype A toxin gene was cloned from C. botulhmm genomic DNA using I'CR 
amplillcation. The following primer pair was employed: 5-CG CCATGC, rTAC. 
ATTATTATCTACATTTAC-3- (5" primer. Ncol site underlined: SEQ ID NO:29) and 
^ -GC'Aj/VGCTTCTTCiACAGACTCA I'(jTAG-3* (.V primer. Hindlll site underlined: SEQ ID 
N<>:3<>). < W/ih™ type A strain was obtained from the American Type C ulture Collection 
< A rCX-ri V97) and grown under anaerobic conditions in Terrific broth medium. Hid, 
molecular-weight C : hondinum DNA was isolated as described in Example 1 1 . The integrity 
and yield of genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA alter electrophoresis on an agarose gel. 

The gene fragment was cloned by PCR utilizing a proofreading thermostable DNA 
polymerase (native />,„ polymerase). PCR amplification was performed using the above 
primer pair in a 50>l reaction containing lOmM Tris-HCI (pH 8.3). 50mM KCI. l.jmM 
MgCk. 200/cM each dNTP. 0.2^M each primer, and 50ng C. h.uulinum genomic DNA. 
Reactions were overlaid with 100>l mineral oil. heated to 94"C 4 min. 0.5^x1 native /'/„ 
polymerase (Stratagene) was added, and thirty cycles comprising 94'C for I min. 50'C for 2 
min. 72-C for 2 min were carried out followed by 10 min at 72'C. An aliquot (10^1) of the 
reaction mixture was resolved on an agarose gel and the amplified native C fragment uene 
was gel purified using the Prep-A-Ciene kit (BioRad) and I i gated to pCRScrip, vector DNA 
(Stratagene). Recombinant clones were isolated and confirmed by restriction digestion, using 
standard recombinant molecular biology techniques |Sambrook cl at. (1989). supra). In 
addition, the sequence of approximately 300 bases located at the 5" end of the C fragment 
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coding region were obtained using standard DN A sequencing methods The sequence 
obtained was identical to that of the published sequence. 

An expression vector containing the native C. botulinum serotype A C fragment gene 
was created by ligation of the Nco\-Hind\\\ fragment containing the C fragment gene from the 
pCRScript clone lo Nhe\-Hmd\\\ restricted pETHisa vector (Example 18b). The Ncol and 
She\ sites were lllled in using the Klenow enzyme prior to ligation: these sites were thus 
blunt-end ligated together, t he resulting construct was termed pHisBotA (native). pHisBotA 
(native) expresses the C botulinum serotype A C fragment with a his-tagged N terminal 
extension which has the following sequence: 

MetGlyllisHisIIisHisIIisHisHi^^ ID 
N():24). where the underlining represents amino acids encoded by the C botulinum C 
fragment yene (this N terminal extension contains the recognition site foi I actorXa protease, 
shown in italics, which can be employed to removed the polyhistdine tract from the N- 
tcrminus of the fusion protein). The pHisBot (native) construct expresses the identical protein 
as the pI lisBot construct (Ex. 24c: herein after the pHisBotA) which contains the synthetic 
gene. 

The predicted DNA sequence encoding the native C botulinum serotype A C fragment 
gene contained within pHisBotA (native) is listed in SlfQ M) NO:3l |ihe start o! translation 
(ATCi) is located at nucleotides 108-1 10 and the stop of translation (TAA) is located at 
nuclcotidesl404-14<)6 in SEQ ID N():3l] and the corresponding amino acid sequence is listed 
in SEQ ID NO:26 {i.e.. the same amino acid sequence as that produced hy pHisBotA 
containing synthetic gene sequences). 

b) Comparison Of The Expression And Purification Yields Of 
C botulinum Serotype A C Fragments Derived From Native 
And Synthetic Expression Vectors 

Recombinant piasmids containing cither the native or the synthetic C. botulinum 
serotype A C fragment genes were transformed into coli strain BI2l(DE3) pLysS and 
protein expression was induced in 1 liter shaker Husk cultures. Total protein extracts were 
isolated, resolved on SDS-IWGE gels and C botulinum C fragment protein was identified by 
Western analysis utilizing a chicken anti-C". botulinum serotype A toxoid antiserum as 
described in Example 22. 
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Briefly, I liter (2XYT + 100 ^ig/ml ampicillin and 34 pg/mi chloramphenicol) cultures 
of bacteria harboring either the pHisBotA (synthetic) or pHisBolA (native) plasmids in the 
BI2KDE3) pLysS strain were induced to express recombinant protein by addition of IPTG to 
ImM. C ultures were grown at 30-3 2°C. IPTG was added when the cell density reached an 
OD W)(1 0.5-1.0 and the induced protein was allowed to accumulate for 3-4 hrs after induction. 

The cells were cooled for 15 min in a ice water bath and then centrifuged for 10 min 
at 5000 rpm in a JAI0 rotor (Beckman) at 4°C. The cell pellets were resuspended in a total 
volume of 40 mis IX binding buffer (40 mM imidazole. 0.5 M NaCL 50 mM NaPO,. pH 
X.0). transferred to two 50 ml Oakridge tubes and frozen at -70°C for at least 1 hr. The tubes 
were then thawed and the cells were lyscd by sonication (using four successive 20 second 
bursts) on ice. The suspension was clarified by centrifugation 20-30 min at 9.000 rpm 
<(I0.000.tr) in a J A- 17 rotor. The soluble I y sate was batch absorbed to 7 ml of a 1:1 slurry of 
NiNTA resin:binding buffer by stirring 2-4 hr at 4°C The slurry was centrifuged for 1 min 
at 500<; in 50 ml tube (Falcon*, resuspended in 5 mis binding buffer and poured into a 2.5 cm 
diameter column- (BioRad). The column was attached to a UV monitor (ISCO) and the 
column was washed with binding buffer until a baseline was established. Imidazole was 
removed by washing with 50mM NaPC) 4 . 0.3 M NaCI. 10% glycerol. pH 7.0 and bound 
protein was eluted using 50mM NaPO^. 0.3 M NaCI. 10% glycerol. pH 3.5-4.0. 

The eluted proteins were stored at 4°C. Samples of total, soluble, and eluted proteins 
were resolved by SDS-PAGE. Protein samples were prepared for electrophoresis by mixing 
Ifil total (T) or soluble (S) protein with 4 p| PBS and 5 j.il 2X SDS-PAGE sample buffer, or 
5 pi eluted <E) protein and 5 pi 2X SDS-PAGE sample buffer. The samples were heated to 
95"C for 5 min. then cooled and 5 or 10 pels were loaded on 12.5% SDS-PAGE gels. Broad 
range molecular weight protein markers (BioRad) were also loaded to allow the MW of Un- 
identified fusion proteins to be estimated. After electrophoresis, protein was detected either 
generally by staining gels with Coomassie blue, or specifically, by blotting to nitrocellulose 
ior Western blot detection of specific immunoreactive protein. 

1 or Western blot analysis, the gels were blotted, and protein transfer was confirmed by 
Ponceau S staining as described in Example 22. After blocking the blots for I hr at room 
temperature in blocking buffer (PBST and 5% milk). 10 ml of a 1/500 dilution of an anli-C 
hondhutm toxin A IgY PEG prep (Ex. 3) in blocking buffer was added and the blots were 
incubated for an additional hour at room temperature. The blots were washed and developed 
using a rabbit anti-chicken alkaline phosphatase conjugate (Boehringer Mannheim) as the 
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secondary antibody as described in Ex.22. This anajysis detected C Mwkw toxin A . 
reacuve proteins in the piljsBotA (native and synthetic) protein samples (corresponding to the 
predicted full length proteins identified by Coomassie staining). i 

A gel containing proteins expressed from the pHisBot and pHisBot (native) constructs 
dunng various stages of purification and stained with Coomassie blue is shown in Figure 3/ 
In Hgure 31. lanes 1-4 and 9 contain proteins expressed by the pHisBotA construct (/c the 
synthet.c gene, and lanes 5-8 contain proteins expressed by the pHisBotA (native, construct 
Lanes , and 5 contain total protein extracts; lanes 2 and 6 contain soluble protein extracts- 
lanes 3 and 7 contain proteins which flowed through the NiNTA columns: lanes 4 8 and * 
contam protein eluted from the NiNTA columns and lane .0 co „, ains moIecular w • h| . 
markers. 

The above purification resulted in a yield of 3 mg (native gene, or 1 . mc (svmhclic 
l?ene, of affinity purified protein from a . liter starting culture, of which at least^O^ of 
the prote.n was a single band of the predicted MW (50kd> and immunoreactivitv for 
recombinant ( W/,„ W serotype A C fragment protein. Other than the level of expression 
»» Terence w as observed between the native and the synthetic gene expression svstcms 

Ihese results demonstrate that soluble C. hotuUnum serotype A C fragment protein can 
be expressed in E. coli and purified utilizing either native or synthetic gene sequences. 

EXAMPLE 29 

Generation Of Neutralizing Antibodies Using A Recombinant 
C • hnndinum Serotype A C Fragment Protein Containing A Six Residue I lis- 1 ag 

In Lxamplc 27. neutralizing antibodies were generated utilizing the pMisBolA protein 
wh,ch contains a histidine-tagged N-tcrminal extension comprising 10 histidine residues To 
determmc ,f the generation of neutralizing antibodies is dependent on the presence of this 
particular his-tag. a protem containing a shorter N-termina. extension (comprising 6 histidine 
res.dues, was produced and tested for the ability to generate neutralizing antibodies This 
example involved a) the cloning and expression of the P 6HisBoiA(syn> protein and b) the 
generation and characterization of hyperimmune scrum. 
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a) Cloning And Expression Of The p6HisBotA(syn) Protein 

The p6HisBotA(syn) construct was generated as described below; the term "syn" 
. designates the presence of synthetic gene sequences. This construct expresses the C frgament 
of the C. hotulimim serotype A toxin with a histidine-tagged N terminal extension having the 
following sequence: MetHisHisHisHisHisHis MetAla (SEQ ID NO:32): the amino acids " 
encoded by the botulinal C fragment gene are underlined and the vector encoded amino acids 
are presented in plain type. 

6XHis oligonucleotides [5"-TATGCATCACCATCACCATCA-3" (SEQ ID NO:33) and 
5 -CATGTGATGGTGATGGTGATGCA-3- (SEQ ID NO:34) were annealed as follows. One 
microgram of each oligonucleotide was mixed in total of 20 ul IX reaction buffer 2 (NEB) 
and the mixture was heated at 70°C for 5 min and then incubated at 42°C for 5 mi n . The 
annealed oligonucleotides were then ligated with gel purified NtleVHirtiXU cleaved pET23b 
0 7 promoter! or pET21b (T7lac promoter) DNA and the gel purified NvoVHin&W C 
hniulimmi serotype A C fragment synthetic gene fragment derived from pAltcrBot (Ex. 22). 
Recombinant clones were isolated and confirmed by restriction digestion. The DNA sequence 
encoding the 6X his-tagged BotA protein contained within P 6HisBotA(syri) is listed in SEQ 
ID NO:35. The amino acid sequence of the p6XHisBotA protein is listed in SEQ ID NO:36. 

The resulting recombinant n6XMisBotA plasmid was transformed into the BI.2KDE3) 
pLysS strain, and I liter cultures were grown, induced and harvested as described in Example 
28. His-tagged protein was purified as described in Example 28. with the following 
modifications. The binding buffer (BB) contained 5 mM imidazole rather than 40 mM 
imidazole and NP40 was added to the soluble lysate to a final concentration of 0. 1 %. The 
bound material was washed on the column with BB until the baseline was established, then 
the column was washed successively with BB+20 mM imidazole and BB+40 mM imidazole. 
The column was eluted as described in Example 28. 

In the case of the pLT23-derived expression system, high level expression of insoluble 
OHisBoiA protein was induced. The pET2 1 -derived vector expressed lower levels of soluble 
protein that bound the NiNTA resin and eluted in the 40 mM imidazole wash rather than 
during the low p|| elution. These results [i.e.. low level expression of a soluble protein) are 
consistent with the results obtained with pHisBotA protein (Ex. 25): the pHisBotA construct, 
like the pET21 -derived vector, contains the T7lac rather than T7 promoter. 

The 6HisBotA protein thus elutes under less stringent conditions than the I OX 
Imtidinc-coniaining pHisBot protein ( 100-200 mM imidazole: Ex. 25) presumably due to the 
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reduction in the length of the his-tag. The eluted protein was of the predicted size [i.e„ 
slightly reduced in comparison to pHisBotA protein]. 

. ; '■ ■ ' • '' . i 

b) feneration And Characterization Of Hyperimmune Serum 
Eight BALBc mice were immunized with purified 6llisBotA protein using Gerbu 
GMDP adjuvant (CC Biotech). The 40 mM imidazole elution was mixed with Gerbu 
adjuvant and used to immunize mice. Each mouse received a subcutaneous injection of 100 
ul antigen/adjuvant mix (12 ug antigen + I ug adjuvant) on day 0. Mice were subcutancouslv 
boosted as above on day 14 and bled on day 28. Control mice received pHisBotB protein 
(prepared as described in Ex. 35 below) in Gerbu adjuvant. 

Anti-C. hotulmum serotype A toxoid titers were determined in serum from individual 
mice from each group using the ELISA described in Example 23a with the exception that the 
initial testing serum dilution was 1:100 in blocking buffer containing 0.5% [ ween 20. 
followed by serial 5-fold dilutions into this buffer. The results of the LI.ISA demonstrated 
that seroconversion (relative to control mice) occurred in all 8 mice. 

The. ability of the anti-f . hotulinwn serotype A C fragment antibodies present in .serum 
from the immunized mice to neutralize native C. holulinum type A toxin was tested using the 
mouse neutralization assay described in Example 23b. The amount of neutralizing antibodies 
present in the serum of the immunized mice was determined using scrum antibody titrations. 
The various serum dilutions (0.01 ml) were mixed with 5 LD<„ units of C. botulinum type A 
toxin and the mixtures were injected IP into mice. The neutralizations were performed in 
duplicate. The mice were then observed for signs of botulism for 4 days. Undiluted serum 
was found to protect 100% of the injected mice while the 1:10 diluted .serum did not. This 
corresponds to a neutralization titer of 0.05-0.5 lU/ml. 

These, results demonstrate that neutralizing antibodies were induced when the 
6HisBoiA protein was utilized as the immunogen. Furthermore, these results demonstrate that 
seroconversion and the generation of neutralizing antibodies docs not depend on the specific 
N terminal extension present on the recombinant C. baiulumm type A C fragment proteins. 
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EXAMPLE 30 

Construction Of Vectors For The Expression Of His-Tagged 
C. hoiulinum Type A T oxin C Fragment Protein Using the 'synthetic Gene 

A number of expression vectors were constructed which contained the synthetic C. 
houdhmm type A toxin C fragment gene. These constructs vary as to the promoter (T7 or 
T7lac) and repressor elements (laclq) present on the plasmid. The T7 promoter is a stronger 
promoter than is the T71ac promoter. The various constructs provide varying expression 
levels and varying levels of plasmid stability. This example involved a) the construction of 
expression vectors containing the synthetic C. botulinum type A C fragment gene and b) the 
determination of the expression level achieved using plasmids containing either the kanamycin 
resistance or the ampicillin resistance genes in small scale cultures. 

a) Construction Of Expression Vectors Containing The 
Synthetic C. botulinum Type A C Fragment Gene 

Hxpression vectors containing the synthetic C. hoiulinum type A C fragment gene were 
engineered to utilize the kanamycin resistance rather than the ampicillin resistance gene. This 
was done for several reasons including concerns regarding the presence of residual ampicillin 
in recombinant protein derived from plasmids containing the ampicillin resistance gene. In 
addition, ampicillin resistant plasmids are more difficult to maintain in culture: the p- 
laclamase secreted by cells containing ampicillin resistant plasmids rapidly degrades 
extracellular ampicillin. allowing the growth of plasm id-negative cells. 

A second altered feature of the expression vectors is the inclusion of laclq gene in the 
plasmid. This repressor lowers expression from lac regulated promoters (the chromosomally 
located, lactose regulated T7 polymerase gene and the plasmid located T7lac promoter). This 
down regulates uninduced protein expression and can enhance the stability of recombinant cell 
lines. The final alteration to the vectors is the inclusion of either the T7 or T7lac promoters 
that drive high or moderate level expression of recombinant protein, respectively. 

The expression plasmids were constructed as follows. In all cases, the protein 
expressed is the pHisBotA(syn) protein previously described, and the only differences between 
constructs is the alteration of the various regulatory elements described above. 
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i) Construction Of pHisBotA(syn) kan T7iac 

The pHisBotA(syn) kan T71ac construct was made by inserting the Sapl/Xhol fragment 
containing the C hatulinum type A C fragment from pHisBotA(syn) into pET24 digested with 
Sapl/Xhol (Novagcn: fragment contains kan gene and origin of replication). The desired 
construct was selected for kanamycin resistance and confirmed by restriction digestion. 

ii) Construction Of plIisBotA(syn) kan laclq T7lac 

The pHisBotA(syn) kan laclq T71ac construct was made by inserting the A7«,l///md||| 
fragment containing the C. bondman, type A C fragment from pllisBotA(syn)kanT71ac into 
the P F.T24a vector digested with Xhal/HmdlU. The resulting construct was confirmed by 
restriction digestion. 

Hi) Construction Of pHisBotA(syn) kan laclq T7 
The pl lisBotA(syn) kan laclq T7 construct was made by inserting the Xhal/Hindlll 
fragment containing the C. bolulinum type A C fragment from pHisBotA(syn) kan laclq T7lac 
into A7 3c ,|///,,,dIII-digested pHisBotB(syn) kan laclq T7 {described in lix 37c below). The 
resulting construct was confirmed by restriction digestion. 

b) Determination Of The Expression Level Achieved Using 

I'lasmids Containing Either The Kanamycin Resistance Or 
The Ampicillin Resistance Genes In Small Scale Cultures 
One liter cultures of pHisBotA(syn) kan T7lac/BI2l(DE3)pl.ysS and pHisBoiA(syn) 
amp T7lac/B121(DE3)pLysS (this is the previously designated pllisBotA(syn» construct! were 
grown, induced and his-tagged proteins were purified as described in Example 28: No 
differences in yield or protein integrity/purity were observed. 

These results demonstrate that the antigen induction levels from expression constructs 
were not affected by the choice of ampicillin versus kanamycin antibiotic resistance genes. 
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EXAMPLE 31 

Fermentation Of Cells Expressing Recombinant Botulinal Proteins 

a) Fermentation Culture Of Cells repressing Recombinant 
Botulinal Proteins 

Fermentation cultures were grown under the following conditions which were 
optimized for growth of the BL21(DE3) strains containing pFT derived expression vectors. 
An overnight 1 liter feeder culture was prepared by inoculating of I liter media (in a 2L 
shaker tlask) with a fresh colony grown on an LB kan plate. The feeder culture contained: 
600 mis nitrogen source [20 gm yeast extract (BBL) and 40 gm tryptone (BBL)/600 mis], 200 
mis 5X fermentation salts (per liter: 48.5 gm K : HP0 4 . 12 gm NafLP0 4 *H,0. 5 gm NH 4 CI. 
2.5 gm NaCT). 180 mis dH : C). 20 mis 20% glucose, 2 mis 1 M MgS0 4 , 5 mis '0.O5M CaCK 
and 4 mis of a 10 mg/ml kanamycin stock. Ail solutions were sterilized by autoclaving. 
except t he kanamycin stock which was filter sterilized. 

An aliquot (5 ml) of the feeder culture broth was removed prior to inoculation, and 
grown for 2 days at 37°C as a culture broth sterility control. Growth was not observed in this 
control culture in any of the fermentations performed. 

Hie inoculated feeder culture was grown for 12-15 hrs (ON) at .>0-37°C\ Care was 
taken to prevent ovcrsaturation of this culture. The saturated feeder culture was added to 10L 
of fermentation media in fermcnter (BiofloIV. New Brunswick Scientific. -Edison. NJ) as 
follows. The fermenter was sterilized 120 min at 121°C with dH,0. The sterile water was 
removed, and fermentation media added as follows: 6 liters nitrogen source. 2 liters 5X 
fermentation salts. 2 liters. 2% glucose. 20 mis 1 M MgS0 4 . 50 mis 0.05 M CaCL. 2.5-3.5 
mis Macol P 400 antifoam (PPG Industries Inc.. Gurnee. IL). 40 mis lOmg/ml kanamycin and 
10 mis trace elements (8 gm FeS0 4 *7H : 0. 2 gm MnS0 4 *H,O. 2 gm AICI^HXK 0.8 gm 
CoCI-6IllO. 0.4 gm ZnS0 4 -7H 3 0. 0.4 gm Na : MoO/2H 7 0. 0.2 gm CuCl/2HX>, 0.2 gm 
NiCL. 0.1 gm H.,BO 4 /200mls 5 M HCl). All solutions were sterilized by autoclaving. except 
the kanamycin stock which was filter sterilized. Fermentation media was prewarmed to 37°C 
before the addition of the feeder culture. 

After the addition of the feeder culture, the culture was fermented at 37°C. 400 rpm 
agitation, and 10 l/min air sparging. The DO : control was set to 20% PID and dissolved 
oxygen levels were controlled by increasing the rate of agitation from 400-850 rpm under 
IX) : control. DO, levels were maintained at greater than or equal to 20% throughout the 
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entire fermentation. When agitation levds reached 500-600 rpm the temperature was lowered 
to 30»C ,o reduce the oxygen consumption rate. Culture growth was continued until 
endogenous carbon sources were depleted. In these fermentations, glucose was depleted first 
[momtored with a glucose monitoring kit (Sigma)], followed by assimilation of acetate and 
other acdic carbons Imonitored using an acetate test kit (Boehringer Mannheim)!. During the 
ass.mi.aUon phase, the P H rose Item 6.6-6.8 (starting pH) to 7.4-7.5. at which time the bulk 
of the remaming carbon source was depleted. This was signaled by a drop in auction rate 
(from a maximum of 700-800 rpm) and a rise in DO ; levels >30%. This corresponds to a 
OD„„ reading of 18-20/ml. At this point a fed batch mode was initiated, in which a teed 
so.ut.on of 50 o/ o giucose was added at a rate of approximately 4 gm glucose/.iter/hr The P H 
was adjusted to 7.0 by the addition of 25% H ; PO, (approximately 60 mls) . Culture growth 
was contmued and reached peak oxygen consumption within the next 3 hrs of growth (while 
the remammg residual non-glucose carbon sources were assimilated). This phase is 
characterized by a slow increase in P H. and air sparging was increased to 15L/min to keep 
the maximum rpm below 850. Once the residual acidic carbon purees are depleted the 
ag.tauon rate decreases to 650-750 rpm and the P H begins ,o drop. pH con tro. was 
mamtained a, 7.0 PID by regulated pump adduion of a sterile 4M NaOII solution which was 
consumed a«.a steady rate for the remainder of the fermentation. Growth was continued at 

30-C. and the cultures were grown linearly at a growth rate of 4-7 OD M>1 units/hr. to at .east 
81.5 OD wl(1 Ullhs/ml (>30g/J dry ccl| we . ght) wkhoui induct . on Ant . foam ^ fl ^ ^ 

Miter sterilized 100% ethano.) was added as necessary throughout the fermentation to prevent 

foaming. 

During the fed batch mode, glucose was assimilated immediately (concentration in 
media consistently less than 0,1 gm/, iter) and acetate was not produced ,n significant levels bv 
the pnr plasmid/BL2l(DE3) cell lines tested (approximately I gm/hter at end of 
fermentation: this is lower than that observed in harvests from shaker flask cultures utilizing 
the same strains). This was fortuitous, since high levels of acetate has been shown lo inhibit 
mducuon levels in a variety of expression systems. The above described conditions were 
found to be highly reproducible between fermentations and utilizing different expression 
plasmids. As a result, glucose and acetate level monitoring were no longer preformed during 
fermentation. 
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b) Induction Of Fermentation Cultures 

induction with IPTG (250 mg-10 gms. depending on the expression vector and 
experiment) was initiated 1-3 hrs after initiation of the glucose feed (io-50 OD W)t /mI). The 
growth rate after induction was monitored on a hourly basis. Aliquots (5-10 ml) of cells were 
harvested at the time of induction, and at hourly intervals post-induction. Optical density 
readings were determined by measuring the absorbance at 600 nm of 10 f.il culture in 990 \i\ 
PBS versus a PBS control. The growth rate after induction was found to vary depending on 
the expression system utilized. 

c) Monitoring Of Fermentation Cultures 

fermentation cultures were monitored using the following control assays. 

i) Colony Forming Ability 

An aliquots of cells were removed from the cultures at each timepoint sampled 
(uninduced and at various times after induction) were serially diluted in PBS (dilution 1 = 15 
Ml cells/3 ml PBS. dilution 2 - 1 5 yd of dilution 1/3 ml PBS. dilution 3 - 3 or 6 yii of 
dilution 2/3mls PBS) and 100 |aI of dilution 3 was plated on an LB or TSA (trypticase soy 
agar) plate. The plates were incubated ON at 37°C and then the colonies are counted and 
scored for macro or micro growth. 

ii) Phenotypic Characterization 

Colonies growing on LB or TSA plates (above) from uninduced and induced 
limepoints were replica plated onto LB+kan, LB+chloramphenicol (for fermentations utilizing 
LysS or pACYCGro plasmids). LB+kan+lmM IPTG and LB plates, in this order. The plates 
were grown 6-8 hrs at 37°C and growth was scored on each plate for a minimum of 40-50 
well isolated colonies. The percentage of cells retaining the plasmid at time of induction (i.e.. 
uninduced cultures immediately prior to the addition of IPTG) was determined to be the U 
colonics LB+Kan tor chloramphenicol) plate/# colonics LB plate X 100%. The percentage of 
cells with mutated pET plasmids was determined to be the H colonies LB+Kan+IPTG plate/// 
colonics LB plate X 100%. Colonies on all LB plates were scored morphologically for £". 
cn/i phenotype as a contamination control. Morphologically detectable contaminant colonies 
were not detected in any fermentation. 
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iii) Recombinant BotA Protein Induction 
• A total ofl0OD wl units of cells 200 u. of cells at OD N „=50/m«) were removed 
from each timepoin, sample to a ..5 ml microfuge tube and pelleted for 2 mi„ al maximum 
rpm ,„ a microfuge. The pellets were re SUS pendcd in I ml of 50 mM NallPO, 0 5 M NaCl 
40,„M imidazole buffer (pH 6.8, containing I mg/ml ,vso 2 yme. The samples were incubated 
»or 20 mm a, room temperature and stored ON a, -70°C. Samples were thawed completely at 
room temperature and sonicated 2 X .0 seconds with a Branson SonUier 450 microtip probe 
at U 3 power setting. The samples were centrifuged for 5 min a, maximum rpm in a 
microiuge. 

An aliquot (20 ul> of the protein samples were removed to 20 ul 2X sample buffer 
before or after centrifugation. for total and soluble protein extracts, respectively The samples 
were heated to ^ for 5 mm thcn coo|ed and 5 ^ ]Q ^ ^ ^ ^ ^ 

SDS-PAGF. gels. High molecular weight protein markers (BioRad, were also .oaded to al.ow 
lor csumation of the MW of identified fusion proteins. After electrophoresis, protein was 
detected ether generally by staining gels with Coomassie blue, or specified, bv blotting 
onto nnrocellulose (as described in Lx. 28) for Western Wot detection of specific lm-,agged 
pro.e.ns utilizing a NiN TA-alkalinc phosphatase conjugate exactly as described bv ,he 
manufacturer (Qiagcn). 

•v, Recombinant Antigen Purification 

At the end of each fermentation run. 1-10 liters of culture were harvested from the 
•ermenter and the bacterial cells were pelleted by centrifugation at 6000 rpm for .0 min in a 
M.O rotor (Beckman). The cell pel.ets were stored frozen at -70°C or ut.lized immediate.v 
wahout freezing. Cell pellets were resuspended to 15-20% weigh, to volume ,„ ^suspension 
butter (generally 50 mM NaPO,. 0.5 M NaCl. 40mM imidazole. P H 6.8) and lysed util.zing 
either sonication or high pressure homogenization. 

I or sonication. the ^suspension buffer was supplemented with lysozvme to I mg/ml 
and the suspension was incubated for 20 min. at room temp. The sample was then frozen ON 
at -70°C. thawed and sonicated 4 X 20 seconds at microtip maximum to reduce viscositv 
For homogenization. the cells were lyzed by 2 passes through a homouenizer (Rannie 
MinMab type 8.30 H) at 600 Bar. Cell lysates were clarified by centrifugation for 30 min at 
10.000 rpm in a JAI0 rotor. 
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For IDA chromatography, samples were flocculated utilizing polyethyleneimine (PEI) 
prior to centrifugation. Cell pellets were resuspended in cell resuspension buffer (CRB: 50 
mM NaP0 4 , 0.5 M NaCL 40 niM imidazole. pH 6.8) to create a 20%' cell suspension (wet 
weight of cells/volume of CRB) and cell iy sates were prepared as described above (sonication 
or homogenization). PEI (a 2% solution in dHXX pH 7.5 with HCI) was added to the cell 
lysate a final concentration of 0.2%. and stirred for 20 min at room temperature prior to 
centrifugation (8.500 rpm in JA10 rotor for 30 minutes at 4°C). This treatment removed 
RNA. DNA and cell wall components, resulting in a clarified. low viscosity lysate ("PEI 
clarified lysate"). 

His-tagged proteins were purified from soluble lysates by metal-chelatc affinity 
chromatography using either a NiNTA resin (as described in Ex. 28) or an IDA (iminodiacetic 
acid) resin as described below. 

IDA resin affinity purifications were performed utilizing a low pressure 
chromatography system (ISCO). A 7 ml (small scale) or 70 ml (large scale) Chelating 
Sepharose Fast Plow (Pharmacia) affinity column was poured; in addition, a second guard 
column was poured and attached in line with the first column (to capture Ni ions that leached 
off the affinity column). The columns were washed with 3 column volumes of dM,0. The 
guard column was then removed and the affinity column was washed with 0.3 M NiS0 4 until 
resistivity was established, then with dIKO until the resistivity returned to baseline. The 
columns were reconnected and equilibrated with cell resuspension buffer (CRB; 50 mM 
NaPO,. 0.5 M NaCL 40 mM imidazole. pH 6.8). The clarified sample (in CRB) was loaded. 
Mow rates were 5 ml/min for small scale columns and 20 ml/mi n for large scale columns. 
After sample loading, the column was washed with CRB until a baseline established and 
bound protein was eluted with cluiion buffer (50 mM NaP0 4 . 0.5 M NaCI. 800 mM 
imidazole. 20% glycerol. pH 6.8 or 8.0). Protein samples were stored at 4°C or -20°C. The 
yield of eluted protein was established by measuring the OD :jh , of the elutions. with a 1 mg/ml 
solution of protein assumed to yield an absorbance reading of 2.0. 

The IDA columns may be regenerated and reused multiple times (>I0). To regenerate 
the column, the column was washed with 2-3 column volumes of M : 0. then 0X)5 M FDTA 
until all of the blue/green color was removed followed by a wash with dH.O. The IDA 
columns were sterilized with 0.1 M NaOH (using at least 3 column volumes but not more 
than 50 minutes contact time with column packing material), then washed with 3 column 
volumes 0.05 M NaPQ 4 . pH 5.0. then dll.O and stored at room temperature in 20 % ethanol. 
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EXAMPLE 32 

Construction Of A Folding Chaperone Overexpression System 

Co-ovcrexpression of the E. coli OroEL/GroES folding chaperones in a ceil expressing 
a recombinant foreign protein has been reported to enhance the solubility of .some foreign 
proteins that are otherwise insoluble when expressed in E. coli [Gragerouu e, a/. ( I99<>) P roc 
Natl. Acad. Sci. USA 89:10344]. The improvement in solubility is thought 'to be due to 
chaperone-mediated binding and unfolding of insoluble denatured proteins, thus allowing 
multiple attem p ts for productive refolding of recombinant proteins. By overexpressing the 
chaperones. the unfolding/refolding reaction is driven by excess chaperone. resulting, in some 
cases, in higher yields of soluble protein. 

In this example, a chaperone overexpression system, compatible with pET vector 
expression systems, was constructed to facilitate testing chaperone-mediated solubilization of 
C hoiuiimm type A proteins. This example involved the cloning of the GroEL/ES operon 
and construction of a pl.ysS-based chaperone hypercxpression system. 

The GroEL/GroES operon was PCR amplified and cloned in.o the pCRScript vector as 
described in Example 28. The following primer pair was used: 5 -C GCAT 
AJGAATATTCGTCCATTGCATG-3- (SEQ ID NO: 37) [5" primer, star, codon of groES 
gene convened to NJe\ site (underlined)) and 5 -GGAAGCJTCiC AGGGCAA7 7>f CATCATG 
(SEQ ID NO:38) (3* primer, stop codon of groEL gene italicized, engineered //,„dlll site 
underlined). Following amplification, the chaperone operon was excised as an ,YV/ t .l///,„dlll 
fragment and cloned into P ET23b digested with M/,l and ///milll. This construction places 
the Gro operon under the control of the T7 promoter of the pET23 vector. The desired 
construct was confirmed by restriction digestion. 

The 17 promoter-Gro operon-T7 terminator expression cassette was then excised as a 
B K l\\llisp\L\ (filled) fragment and cloned into BamH\ (compatible with %/II)///»7dIlI (filled) 
cleaved pLysS plasmid (this removed the T7 l yS0 7.yme gene). The resulting construct was 
designated pACYCGro. since the plasmid utilizing the pACYCI84 origin from ,he plysS 
plasmid. Proper construction was confirmed by restriction digestion. 

PACYCGro was transformed into BL2I(DE3). cultures were grown and induced with 
I mM IPTG as described in preceding examples. Total and soluble protein extracts were 
generated from cells removed before and after IPTG induction and were resolved on a 12.5 % 
SDS-PAGE gel and stained with Coomassie blue, t his analysis revealed that high levels of 
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soluble GroEl and GroES proteins were made in the induced cells. These results 
demonstrated that the chaperone hypcr-cxpression system was functional. 

EXAMPLE 33 

Growth Oi BotA/pACYCGro Cell Lines In Fermentation Cultures 

Induction of BL21(DE3> cells lacking the LysS plasmid which contained BotA 
expression constructs grown in shaker flask or fermentation culture resulted in the expression 
of primarily insoluble BotA protein, f ermentation cultures were performed to determine if 
the simultaneous overexpression of the Gro operon and recombinant C*. hand mum type A 
proteins (BotA proteins) resulted in enhanced solubility of the recombinant BotA protein. 
• This, example involved the fermentation of pHisBotA(syn)kan laclq T7]ac/pACYCGro 
BL2KDE3) and pHisBotA(syn)kan laclq TTVpACYCGro BL2KDE3) cell lines. The 
fermentations were repeated exactly as described in Example 3 1 . Chloramphenicol (34 
jig/ ml) was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBot A(svn)kan laclq TZIac/pACYCGro 
BL21(I)E3) Cells 

for fermentation of cells containing plasmids comprising the T7luc promoter, 
induction was with 2 yms IPTG at 1 hr post initiation of glucose feed. The OD MNI was 35 at 
time of induction, then 48.5. 61.5. 67 at 1-3 hrs post induction. Viable colony counts 
decreased from 0-3 hr induction [21 (13). 0, 0. 0: dilution 3 utilized 3 ul of dilution 2 cells] 
with numbers in parenthesis for the indicating microcolonies. Of 28 colonies scored at the 
time of induction. 23 retained the pi lisBotA(syn)kan laclq T7lac plasmid (kan resistant). 22 
contained the chaperone plasmid (chloramphenicol resistant) and no colonies at induction 
grew on IPTG+K.an plates <no mutations delected). These results were indicative of very 
strong promoter induction, since colony viability dropped immediately after induction. 

total and soluble extracts were resolved on a 12.5% SOS-PAGE gel and stained with 
Coomassie. High level induction of Gro chaperones was observed, but very low level 
expression of soluble BotA protein was observed, increasing from I to 4.0 hrs post induction 
(no expression detected in uninduced cells). The dramatically lower expression of the BotA 
antigen in the presence of chaperone may be due to promoter occlusion [i.e.. the stronger T7 
promoter on the chaperone plasmid is preferentially utilized). 
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b) Fermentation Of pHisBotA( S yn)kan laclq T7/ pACVCGro 
BL21(DE3) Cells 

A fermentation utilizing the T7-driven BolA expression plasmid was performed 
Induetion was with . gm IPTG at 2 hrs post initiation of glucose feed. The OD„„ was 4. at 

°' ,ndUC,i ° n - the " 51 5 - 6 «-5« 615 -«« <* ,-4 hrs p« sl i„ duction . Vi ab.e colony 
counts decreased from 0-4 hrs induction [71. I (34). ,,„.,. 0: UiIution 3 utilized fi . 
d.luuo„ 2 ce«.s, with numbers in parenthesis for the uninduced timepoint indicating 
nucroco.onie, Of 65 colonies scored at the time of induction, a., 65 retained both the 
P H, S Bo,A<syn,kan laclq T7 plasmid (kan resistant, and the chaperonc plasmid 
(chloramphenicol resistant) and no colonies at induetion grew on IPTG+Kan plates (no 
mutations detected). 

Total and soluble extracts were resolved on a .2.5% SDS-PAGH gel and stained with 
Coomass,. High .eve, induction of Gro chaperones and moderate level expression of soluble 

Pr0,Cin WaS ° bSCrVed - *"* ' to 4.0 hrs post induction (no expression 

detected m uninduced cells). 

A PLI-clarified lysate (0.2% final cocnentration PF.I , [850 ml from 1 30 g m cell pellet 
(2 hters lermentation harvest,) was purified on a large scale IDA column. A total of 78 mu 
ul pr»«e,n was eluted. Fxtracts from the purification were resolved on a P 5% SDS PAGH 
gel and stained with Ooomass.e. The e.ution was found to contain an approx.mate.v I , mix 
«l Bo,A,chapcrone protein (Figure 32). PEI lysates prepared in this manner were tvpicallv 16 
ODwml. This was estimated to be 8 mg protein/ml of lysate ( bv BCA assav , Thus the 
eluted recombinant UotA protein represented 0.55% of the total soluble cellular protein 
applied to the column. 

I" Figure 32. lane I contains molecular weight markers, lanes 2-9 contain extracts 
from P HisBotA(svn,kan laclq T7/pACYCGro/13L2 1 (DF3) cells before or durinu purification 
on the IDA column. Lane 2 contains total protein extract: lane 3 contains soluble protein 
extract: lanes 4 and 5 contain PFl-elarified lysates (duplicates,: lanes 6 and 7 contain .low- 
Ihrough from the IDA column (duplicates, and lanes 8 and o contain IDA column elute <>ane 
( ) contains 1/10 the amount applied to lane K). 

These results demonstrate, that although the majority of the Uo.A protein produced 
was insoluble. 20 mg/liter of soluble recombinant UotA protein can be punfied utilising the 
pH,sBotA(syn)kan lack, I7/pACYCGro/BL2l(DE3) expression svstem. 
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EXAMPLE 34 

Purification Of Recombinant BotA Protein From Folding Chaperones 

.- ■ ■. ■ ■ I 

In this example of size exclusion chromatography was used to purify the recombinant 
BotA protein away from the folding chaperones and imidazole present in the IDA-purified 
material (Ex. 3.1). 

To enhance the solubility of the recombinant BotA protein during scale-up. the protein 
was co-expressed with folding chaperones (Ex. 33). As observed with the recombinant BotB 
protein (Example 40 below), the folding chaperones co-eluted with the recombinant BotA 
protein during the Ni-IDA purification step. Because the recombinant BotA and BotB 
proteins have similar molecular weights (about 1/10 the size of the non-reduced folding 
chaperone) and the imidazole step gradient strategy was unsuccessful in purifying BotB away 
from the folding chaperone (see Ex. 40). size exclusion chromatography was examined for the 
ability to -purify the recombinant BotA protein away from the folding chaperones. 

A column (2.5 x 24 cm) containing Sephacryl S-100 HR (Pharmacia) was poured (bed 
volume 110 ml). Proteins having molecular weights greater than 100 K are expected to elutc 
in the void volume under these conditions and smaller proteins should be retained by the 
beads aud eluie at different times, depending on their molecuiar weights. To maintain 
solubility of the purified BotA protein, the Sephacryl column was equilibrated in a buffer 
having the same salt concentration as the buffer used to e lute the BotA protein from the IDA 
column </.c . 50 mM sodium phosphate. 0.5 M NaCI. 10% glycerol: all reagents from 
Mallinkrodt. Chesterfield. MO). 

l ive milliliters ol the IDA-purified recombinant BotA protein (Ex. 33) was filtered 
through a 0.45 y. syringe filter, applied to the column and the equilibration buffer was 
pumped through the column at a flow rate of I ml/minute. Eluted proteins were monitored 
by absorbance at 280 nm and collected either manually or with a fraction collector (BioRad). 
Appropriate fractions were pooled, if necessary, and the protein was quantitated by absorbance 
at 280 nm and/or BCA protein assay (Pierce). The isolated peaks were then analyzed by 
native and/or SDS-PAGE to identify the proteins present and to evaluate purity. The foldinu 
chaperone eluted first, followed by the recombinant BotA protein and then the imidazole 
peak. 

SDS-PAGE analysis (12.5% polyacrylamide. reduced samples) was used to evaluate 
the purity of the IDA-purified recombinant BotA protein before and after S-100 purification. 
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Figure 33 shows the difference in purity before and after the S-100 purification step. f„ 
Figure 33. lane I contains molecular weight markers (BioRad broad range). Lane 2 shows 
the IDA-purified recombinant BotA protein preparation, which is contaminated with 
significant amounts of the folding chaperone. Following S-KK) purification, the amount of 
»olding chaperone present in the BotA sample is reduced dramatically (lane 3). Lane 4 
contains no protein (/.,.. i, is a blank lane): lanes 5-8 contain samples of IDA-purified 
recombinant BotB and BotE proteins and arc discussed infra. 

Endotoxin levels in the S-100 purified BotA preparation were determined using the 
LAI. assay (Associates of Cape Cod) as describe in Example 24. The purified BotA 
preparation was found to contain 22.7 to 45.5 EU/mg recombinant protein. 

These results demonstrate that size exclusion chromatography was successful in 
..purifying, the recombinant BotA protein from folding chaperones and imidazole following an 
m.t,al IDA purification step Furthermore, these results demons.ra.e that the S-100 purified 
BotA protein was substantially free of endotoxin. 

EXAMPLE 35 

Cloning And Expression Of The C Fragment 
Of The C. hotti/imim Serotype B Toxin Gene 

The C. ho.ulinum type B neurotoxin gene has been cloned and sequenced (Whelan ct 
ol. (1992) Appl. Environ. Microbiol. 58:2345 and Hutson ct ul. (1994) Curr. Microbiol. 
28: 101 1. I hc nucleotide sequence of the toxin gene derived from the Eklund I 7B strain 
( ATCC 25765) ,s available from the EMBL/GenBank sequence data banks under the 
accession number X7I343: the nucleotide sequence of the coding region is listed in SEQ ID 
NO:39. The amino acid sequence of the C. hoiulinum type B neurotoxin derived from the 
strain Eklund 17B is listed in SEQ ID NO:40. The nucleotide sequence of the C hon.Hnun, 
serotype B toxin gene derived from the Danish strain is listed in SEQ ID NO:4l and the 
corresponding amino acid sequence is listed in SEQ ID NO:42. 

I hc DNA sequence encoding the native C. hoiulinum serotype B C fragment gene 
derived from the Eklund I7B strain can be expressed using the pETI Iisb vector: the resulting 
coding region is listed in SEQ ID NO:43 and the corresponding amino ac.d sequence is listed 
in SEQ ID NO:44. The DNA sequence encoding the native C. hoiulinum serotype B C 
fragment gene derived from the Danish strain can be expressed using the pETHisb vector: the 
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resulting coding region is listed in SEQ ID NO:45 and the corresponding amino acid sequence 
is listed in SEQ ID NO:46. The C frgament region from any strain of C. hotulinum serotype 
B can be amplified and expressed using the approach illustrated below using the C fragment 
derived from C. hotulinum type B 2017 strain. 

The C hotulinum type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds: the type B neurotoxin has been reported to exist as a mixture of predominate 
single chain with some double chain < Whelan et <//.. supra). The 50 kD carboxy-tcrminal 
portion of the heavy chain is referred to as the C fragment or the H t - domain. Expression of 
the C fragment of C. hotulinum type B toxin in heterologous hosts (c E. cofi) has not been 
previously reported. 

The native C fragment of the C. hotulinum serotype B toxin gene was cloned and 
expression constructs were made to facilitate protein expression in L\ colt. This example 
involved PCR amplification of the gene, cloning, and construction of expression vectors. 

The C fragment of the C, hotulinum serotype B (BotB) toxin gene was cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene. The C \ hotulinum type B 2017 strain was obtained from the American Type Culture 
Collection (ATCC £17843). The following primer pair was used to amplify the BotB gene: 
5 -C( iCCATGGCTGATACAATACI A ATAGAA ATG-3* [5' primer, engineered Ncol site 
underlined (SEQ-ID NO:47)| and 5 -GC AAG CTT 77V1 TTCAGTCC A f YTTTf ' a TP-V \y 
primer, engineered HindlU site underlined, native gene termination codon italicized (SEQ ID 
NO:48)|. Alter cloning into the pCRscript vector, the A/rcK filled )/ Hindi 1 1 fragment was 
.cloned into -pETHisb vector as described for BotA C fragment gene in Example 28. The 
resulting construct was termed pHisBotB. 

pHisBotB expresses the BotB gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
pHisBotB expression construct was transformed into BL2l(DE3) pLysS competent cells and I 
liter cultures were grown, induced and his-tagged proteins were purified utilizing a NiNTA 
resin (eluted in low pll elution buffer) as described in Example 28. Total, soluble and 
purified proteins were resolved by SDS-PAGE and detected by Coomassie staining and 
Western blot hybridization utilizing a chicken anti-C hotulinum serotype B toxoid primary 
antibody (generated by immunization of hens using C. hotulinum serotype B toxoid as 
described in Example 3). Samples of BotA and BotE C fragment proteins were included on 
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■he gels for MW and immunogenic^ comparisons. Strong immunoreactivitv to only the 
BotB protein was detected with the anti-< ; serotype B toxoid antibodies The 

recombinant BotB protein was expressed at ,ow levels (3 mg/>iter> as a soluble protein The 
punfied BotB protein migrated as a single band of the predicted MW (i.e.. - 50kD) 

These results demonstrate the cloning of the native C ; hotulinum serotype B'C 
•ragmen, gene, the expression and purification of the recombinant BotB protein as a soluble 
Jus-tagged protein in E. voli. 

EXAMPLE 36 

Generation Of Neutralizing Antibodies Using The Recombinant pHisBolB Protein 

The ability of the purified pHisBot protein to generate neutralizing antibodies was 
examined. Nine BALBc mice were immunized with BotB protein (purified as described in 
l-x. 35 > using Cicrbu GMDP adjuvant (CC Biotech , The low p, , eIution was mixed wim 
Cicrbu adjuvant and used to immunize mice. Each mous, received a subcutaneous Section 
ol 100 Ml antigen/adjuvant mix ( .5 M g antigen + 1 M g adjuvant, on dav (, Mice were 
subcutaneously boosted as above on day .4 and bled on day 28. Mice were subscqucn.lv 
boosted I -Z weeks alter bleeding and were then bled on day 70. 

Anti-r. /,„/„/„„„„ scrotvpe B Ioxojd Ihcrs werc dctcrmjncd jn day 2f{ sen|m 
individual ,„ice from each group using the HLISA protocol outlined in Sample 29 with the 
except™ that the p.ates were coated with C. hon.linum serotype B toxoid, and the primarv 
ant,bod> was a chicken ant.-C W/™„ serotype B toxoid. Seroconversion |relative to ' 
contro. mice immunized with pHisBotH antigen (described below, | was observed w,th all o 
mice immunized with the purified pHisBotB protein. 

The ability of the anti-BotB antibodies to neutralize native C W,,„,„, tV pe B toxin 
was tested in a mouse-C W,,„„„ neutralization mode, using pooled mouse serum (see 1.x 
23b). I he L D<(1 of purified C. ho.ulinmn type B toxin complex (Dr. Lric Johnson Universitv 
of Wisconsin. Madison, was determined by a intraperitoneal (IP) method fSchantz and Kautler 
( 1978). .„„„,„ usmg 18-22 g female ICR mice. The amount of neutralizing antibodies present 
m the serum ol the immunized mice was determined using serum antibodv titrations The 
various serum di.utions (U.O. ml, were m.xed with 5 LD. umts o, C. A,,,,,/,,,,,,,, lvpe ft toxin 
and the mixtures were injected ,P into m.ce. I he neutra.izations were performed in duplicate 
I he mux- were then observed tor signs of botulism for 4 days. Undiluted serum ,dav ^8 or 
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day 70) was found to protect J 00% of the injected mice while the 1:10 diluted serum did not. 
This corresponds to a neutralization titer of 0 05-0.5 lU/ml. 

These results demonstrate that seroconversion occurred and neutralizing antibodies 
were induced when the pi lisBotB protein was utilized as the immunogen. 

EXAMPLE 37 

Construction Of Vectors To Facilitate Impression 
Of Ilis-Taggcd BotB Protein In Fermentation Cultures 

A number of expression vectors were constructed to facilitate the expression of 
recombinant BotB protein in large scale fermentation culture. These constructs varied as to 
the strength of the promoter utilized (T7 or T7lac, and .he presence of repressor elements 
(laclc,, on the plasmid. The resulting constructs varied in the level of expression achieved and 
in plasmid stability which facilitated the selection of a optimal expression system for 
fermentation scaleup. 

The BotB expression vectors created for fermentation culture were engineered to 
utilize the kanamycin rather than the ampicillin resistance gene, and contained cither the T7 or 
T7lac promoter, with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBot B 
protein described in Example 35. with the only differences between clones being the alteration 
of various regulatory elements. Using the designations outlined below, the pHisBotB clone 
(Ex. 35) is equivalent to pHisBotB amp T7lac. 

a) Construction Of pHisBotB kan T7lac 

pHisBotB kan T71ac was constructed by insertion of the Bg/WHimilU fragment of 
pHisBotB which contains the BotB gene sequences into the pPA 1 870-2680 kan T7lac vector 
which had been digested with B K M and //mdlll (the pPA 1 870-2680 kan T7lac vector contains 
the pET24 kan gene in the P ET23 vector, such that no laclq gene is present). Proper 
construction of pHisBotB kan T7lac was confirmed by restriction digestion. 
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b) Construction Of pHisBotB kan laclq T7lac 

pHisBotB kan laclq T7lac was constructed by insertion of the Bgl\\l HindUX fragment 
of pHisBotB which contains the BotB gene sequences into similarly cutlpHT24a vector 
Proper construction of pHisBotB kan laclq TTIac was confirmed by restriction digestion. 

c) Construction Of pHisBotB kan laclq T7 

pHisBotB kan laclq T7 was constructed by inserting the Ndel/Xhol fragment from 
pllisBotF. kan laclq T7lac which contains the BotB gene sequences into similarly cleaved 
pl»A 1 870-2680 kan laclq T7 vector (this vector contains the T7 promoter, the same N- 
terminal his-tag as the Bot constructs, the C. difficile toxin A insert, and the kan laclq genes- 
this cloning replaces the C V difficile toxin A insert with the BotB insert). Proper construction 
was confirmed by restriction digestion. 

Expression of recombinant BotB protein from these expression vectors and purification 
ol the BotB protein is described in F.xample 38 below. 

EXAMPLE 38 

Fermentation And Purification Of Recombinant BotB Protein Utilizing The 
PHisBotB kan laclq T71ac. pHisBotB kan T7lac And pHisBotB kan laclq 7 7 Vectors 

The pHisBotB kan laclq T7lac. pHisBotB kan T7lac and BotB kan laclq T7 constructs 
lall transformed into the BI2HDF.3) s«rain| were grown in fermentation cultures to determine 
the utility of the various constructs for large scale express.on and purification of soluble BotB 
protein. All fermentations were performed as described in F.xample 31. 

a) Fermentation Of pHisBotB kan laclq T7lac/BI21(DE3) Cells 
The fermentation culture was induced 45 min post start of glucose feed with 1 gm 
IPTCJ (final concentration = 0.4 mM). pH was maintained at 6.5 rather than 7.0. The OD, 1(1( , 
was 27 at lime of induction, then 35. 38. and 40 at 1-3 hrs post induction. Duplicate platings 
of diluted 1 hr induction samples (dilutions were prepared as described Fx. 31. dilution 3 
utilized 3 Ml of dilution 2 cells) on I SA and LB Mean plates yielded K<) I SA colonics and 81 
kan colonies (90% kan resistant). 

Total and soluble protein extracts were resolved on a 12.5% SDS-PAGF. gel and total 
protein was detected by staining with Coomassic blue. Low level induction of insoluble Bot 
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B protein was observed, increasing from 1 to 3 hrs post induction (no expression was detected 
in uninduced cells). 

. ■ ■ • ' • i 

l>) Fermentation Of pHisBotB kan T7lac/BI2!(DE3) Cells 

The fermentation culture was induced I hr post start of glucose feed with 2 gm IPTG 
(final concentration = 0.8 mM). pi I was maintained at 6.5 rather than 7.0. The OD W10 was 
24.5 at time of induction, then 3 1 .5. 32.. and 33 at I -3 hrs post induction, respectively.' 
Duplicate platings of diluted 0 hr and 2 hr induction samples (dilutions were prepared as 
described Ex. 31: dilution 3 utilized 3 ul of dilution 2 cells) on I SA and LB+kari plates 
yielded 32 TSA colonies and 54 kan colonies (all kan resistant) tor uninduced cells, and I 
I SA colony and 0 kan colonies 2 hr post induction. These results were indicative of sirong 
induction, since viable counts decreased dramatically 2 hrs post induction. 

Total and soluble extracts were resolved on a 10% SDS-PAGE gel and total protein 
was detected by staining w ith Coomassie blue. Moderate induction of insoluble BotB protein 
■was observed, increasing from I to 3 hrs post induction (no expression was detected in 
. uninduced cells). 

c); Fermentation Of pHisBotB kan laclq T7/BI21(DO) Cells 
The fermentation was induced 2 hr post start of glucose Iced with 4 gm IPTG (final 
concentration = 1.6 mM). pU was maintained at 6.5 rather than 7.0. The OD MK , was 45 at 
time of induction, then 47. 50. and 50 and 55 at 1-4 hrs post induction, respectively. Viable 
colony counts decreased after induction (96. I. I. 2. 3; dilution 3 utilized 3 M l of dilution 2 
cells). Of 63 -colonics scored at the time of induction, all 63 retaining the BotB plasmid (kan 
resistant) and no colonies at induction grew on IPTG + Kan plates (no mutations detected). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGE gel and total protein 
was detected by staining with Coomassie blue. Moderate level induction of insoluble BotB 
protein was observed, increasing from I to 4 hrs post induction (lower level expression was 
detected in uninduced cells, since the T7 rather than T7lac promoter was utilized). 

d) Purification Of pHisBotB Protein From pHisBotB amp 
T7lac/BI21(DE3) Cells 

Soluble recombinant BotB protein was purified utilizing NiNTA resin from 80 ml of 
cell lysate generated from cells harvested from a pHisBotB fermentation | using the pHisBotB 
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amp T71ac/BI2I(DE3) strain]. As predicted from the small scale results above, the majority 
of the induced protein was insoluble. As well, the eluted material was contaminated with 
multiple E. co/i contaminant proteins. A Coomassie blue-stained SDS-PAGE gel containing 
extracts derived from pHisBotB amp T7lac/B12I(DE3) cells before and during purification is 
shown in Figure 34. In Figure 34. lane 1 contains broad range protein MW markers 
(BioRad). Lanes 2-5 contain extracts prepared from pHisBotB amp T7Iac/BI21(DH3, cells 
grown in fermentation culture: lane 2 contains total proton: lane 3 contains soluble protein: 
lane 4 contains protein which did not bind to the NiNTA column </.,.. the How-through, and 
lane 5 contains protein eluted from the NiNTA column. 

Similar results were obtained using a small scale IDA column utilizing a cell lysatc 
from the pllisBo.B kan laclq T7 fermentation described above. 250 mis of a 20% w/v PF.I 
clarified lysaie (50 gms ceil pellet) of botB kan laclq T7/BI2KDE3) cells were purified on a 
small scale IDA column. The total yield ol eluted protein was 21 mg protein (assuming I 
mg/ml solution = 2 OD M /ml). When analyzed by SDS-PACiL and Coomassie staining.'the 
BotB protein was found to comprise approximately 50% of the eluted protein with the 
remainder being a ladder of £. „,// proteins similar to that observed with the NiNTA 
purification. 

The NiNTA alkaline phosphatase conjugate was utilized to detect his-tagged proteins 
on a Western blot containing total, soluble, soluble <PFI clarified), soluble (after IDA column) 
and clution samples from the IDA column purification. The results demonstrated that a small 
percentage of BotB protein was soluble, that the soluble protein was not precipitated by PCI 
treatment and was quantitatively bound by the IDA column. Since a I liter fermentation 
harvest yielded a 67.5 gm cell pellet, this indicated that the yield of soluble affinity purified 
BotB protein from the IDA column was 14 mg/litcr. 

EXAMPLE 39 

Co-Expression Of Recombinant BotB Proteins 
And Folding Chapcroncs In Fermentation Cultures 

Fermentations were performed to determine if the simultaneous overexpression of 
folding chapcroncs (i.e.. the tiro operon) and the BotB protein resulted in enhanced solubility 
of the Bot B protein. This example involved fermentation of the plIisBotBkan laclq 
T7lac/ P ACYCGro BL2I(DE3). pHisBotB kan T71ac/ P ACYCGro I312HDE3) and plIisBotBkan 
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laclq T7/ pACYCGro BL2I(DE3) cell lines. Fermentation was carried out as described in 
Example 31: 34 M g/ml chloramphenicol was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBotBkan laclq T7|ac/pACYCGro 
BL2I(DE3) Cells 

Induction was with 4 gms IPTG at I hr 15 min post initiation of the glucose feed. 
The ()D, 1H , was 38 at time of induction, then 50. 58.5. 62 and 68 at 1-4 hrs post induction. 
Viable colony counts decreased during induction (24. 0. 0. 2. 0 at 0-4 hr induction: dilution 3 
utilized 3 ul of dilution 2 cells), -or 24 colonies scored at the time of induction. 24 retained 
the BotB plasmid (kan resistant). 24 contained the chaperone plasmid (chloramphenicol 
resistant, and no colonies at induction grew on IPTG-. Kan plates (no mutations detected). 

Total and soluble extracts were resolved on 12.5% SDS-PAGE gels and were either 
stained with Coomassie blue or subjected to Western blotting (his-tagged proteins were 
detected utilizing the NiNTA-alkaline phosphatase conjugate). This analysis revealed that the 
Ciro chaperoncs were induced to high levels, but very low level expression of soluble BotB 
protein was observed, increasing from I to 4.0 hrs post induction (no expression detected in 
uninduced cells, induced protein detected only on Western blot). The dramatically lower 
expression of BotB protein in the presence of chaperone may he due to promoter occlusion 
(i.e.. the stronger 17 promoter on the chaperone plasmid was preferentially utilized). 

»>) Fermentation Of pHisBotB kan T7lac/pACYCGro/BI2I(DE3) 
Cells 

Induction was with 4 gms IPTG at 1 hr post initiation of the glucose feed. The OD^ 
was 33.5 at time of induction, then 44. 51. 58.5 and 69 at 1-4 hrs post induction. Viable ^ 
colony counts decreased after 2 hrs induction (43. 65. 74. 0 (70). 0 (70) at 0-4 hr induction: 
bracketed numbers represent microcolonies: dilution 3 utilized 3 ul of dilution 2 cells). Most 
colonies at induction retained the BotB plasmid (kan resistant )and the chaperone plasmid 
(chloramphenicol resistant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations detected). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGK gel and subjected to 
Western blotting: his-tagged proteins were detected utilizing the NiNTA-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperoncs were induced to high levels and 
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low level expression of soluble Bot B protein was observed, increasing from I to 4.0 hrs post 

induction (no expression detected in uninduced cells). 

A small scale IDA purification of BotB protein from a 250 ml PKI clarified 15% w/v 
extract (37.5 gm cell pellet) yielded approximately 12.5 mg proteiu. of which approximated 
50% was BotB protein and 50% was GroEL chaperonc (assessed by Coomassie staining of a 

10% SDS-PAGE gel). The NiNTA alkaline phosphatase conjugate was utilized to detect his- 
lagged proteins on a Western blot containing total, soluble, soluble (PEI clarified), soluble 
(alter IDA column) and clution samples from the IDA column purification. The results 
demonstrated that all of the BotB proiein produced by the pHisBolB kan 
T7lac/ P ACYCG,o/B121(DE3) cells was soluble: the BotB protein was not precipitated bv PEI 
treatment and was -quantitatively bound by the IDA column. Since a 1 liter fermentation 
harvest yielded a 75 gm ceil pellet, this indicated that the yield of soluble affinity purified not 
U protein from this fermentation was 1 2.5 mg/liter. These results also demonstrated that 
additional purification steps are necessary «o separate the chaperonc proteins from the BotB 
protein. 

c) Fermentation Of pHisBotBkan laclq 
T7/pACYCGro/BL2I(DE3) Cells 

Induction was with 4 gms IPTG at 2 hr post initiation of the glucose Iced. The OD„ K , 
was 46 at time of induction, then 56. 63. 69 and 71.5 at 1-4 hrs post induction. Viable colony 
counts decreased after induction (58. 3(5). 3. 0. 0 at 0-4 hr induction: bracketed numbers 
represent microcolonies: dilution 3 utilized 3 ul of dilution 2 cells). All (53/53) colonies 
scored at the time of induction retained the BotB plasmid (kan resistant) and the chaperone 
plasmid (chloramphenicol resistant) and no colonies at induction grew on IPTG + Kan plates 
(no mutations detected). 

Total and soluble extracts were resolved on a 10% SDS-PAGE gels and Western 
blotted and his-tagged proteins were detected utilizing the NiNTA-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperones were induced to high levels 
(observed by ponceau S staining), and a much higher expression of soluble Bot B protein 
(compared to expression in the pHisBotU kan T7lac/pACYCGm fermentation) was observed 
at all timepoints. including uninduced cells (some increase in BotB protein levels were 
observed after induction). 
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A small scale IDA purification of BotB protein from a 100 ml PEI clarified 15% w/v 
extract (15 gm cell pellet) yielded approximately 40 mg protein, of which approximately 50% 
was BotB protein and 50% was GroEL chaperone. as assessed by Coomassie staining of a 
10% SDS-PAGE gel. The Ni NT A alkaline phosphatase conjugate was utilized to detect his- 
tagged proteins on a Western blot containing total, soluble, soluble (PEI clarified), soluble 
{after IDA column) and elution samples from the IDA column purification. The results 
demonstrated that a significant percentage -10-20 %) of BotB protein was soluble, that 
the snlubilized protein was not precipitated by PEI treatment and was quantitatively bound by 
the IDA column. Since a 10 liter fermentation yielded a 108 gm cell pellet, this indicated that 
the yield of soluble affinity purified BotB protein from this fermentation was 144 mg/liter. 

In a scale up experiment. 2 liters of a 20% w/v PEI clarified lysale of pHisBotB kan 
laclq I 7/pAC YCCiro/BL2 1 ( DE3 ) cells were purified on a large scale IDA column. The 
purification was performed in duplicate. The total yield of BotB protein was 220 and 325 
mgs protein in the two experiments (assuming I mg/mi solution 2.0 OD 28I /ml). This 
represents 0.7% or 1.0%. respectively, of the total soluble cellular protein (assuming a PEI 
l> state having a concentration of 8 mg protein/ml and that the eluted material comprises a 1:1 
mixture of BotB and folding chaperone). The NiNTA alkaline phosphatase conjugate was 
utilized. to detect his-tagged proteins on a Western blot containing total, soluble, soluble (PEI 
clarified*, soluble (after IDA column) and elution samples from the IDA column purification. 
These results demonstrated that a significant percentage (/.c. -10-20 %> of the BotB protein 
was soluble, that the solubilized protein was not precipitated by PEI treatment and was 
quantitatively hound by the IDA column. Since a I liter fermentation harvest yielded a 10K 
gm eel! pellet, this indicated that the yield of soluble affinity purified BotB protein from the 
large scale purification was 60 mg or 89 mg/Iitcr. These results also demonstrated thai 
turthcr purification would be necessary to remove the contaminating chaperone protein. 

The above results provide methodologies for the purification of soluble BotB protein 
from fermentation cultures, in a form contaminated predominantly with a single E. coli 
protein (the folding chaperone utilized to enhance solubility). In the next example, methods 
are provided for the remov al of the contaminating chaperone protein. 
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EXAMPLE 40 

Removal Of Contaminating Folding Chaperone Protein 
From Purified Recombinant t: to// W m Type H Protein . 

In this example size exclusion chromatography and ultrafiltration Was used to purify 
recombman, BotB protein from the folding chaperones and imidazole in IDA-puritled 

material. 

To enhance the solubility of the recombinant BotB protein during scale-up. the protein 
was co-expressed with folding chaperones (see Lx. 39). During the Ni-HM purification step 
•he loldmg chaperones co-eluted with the BotB protein in 800 mM imidazole- therefore a 
■second purification step was required to isolate the BotB free of foldinu chaperones I ane 3 
of F,gure 35 contains proteins e.utcd from an IDA column to which a lysate of pl,i sBo tB kan 
•ado T7/pACYCGro/BL2l,DC3) cells had been applied: the proteins were reso.ved on a 
4-1.% polvacrylamide pre-cast gradient gel (Bio-Rad. Hercules. CA» run under native 
conduions and then stained with Coomassie blue. .„ Figure 35. lanes , and 4 contain protein, 
present in peak 1 and peak 2 from a Sephacry, S-.00 column run as described below: lane 2 
is blank. 

As seen in lane 3 of Figure 35. the IDA-puritied sample consists primarily of the 
loldmg chaperones and the BotB protein. The fact that the chaperones and the Bot B antigen 
appear as two distinct bands under native conditions suggested thev were not complexcd 
together and therefore, it should be possib.e to separate them, using either a gradient of 
imidazole concentrations or size exclusion methods. 

In order to determine whether a gradient of imidazole concentrations could be used to 
separate the chaperone from the BotB protein, a step gradient using imidazole at 200. 400. 
600. and 800 mM in 50 mM sodium phosphate. 0.5 M NaCl and 10 % glycerol. pH 6.8 was 
applied to an IDA column (containing proteins bound from a lysate of pHisBotB kan laclq 
r7/pACYCGro/BL2l(DL:3) cells). By narrowing the range of imidazole concentrations i, 
was hoped that the BotB and chaperone proteins would differentially elule at different 
concentrations of imidazole. Buted proteins were monitored by absorbanee at 280 nm and 
collected either manually or with a fraction collector (BioRad). Protein was found to elute at 
200 and 400 mM imidazole only. 

Figure 36 shows a Coomassie stained SDS-PAGF gel containing protein eluted durin- 
the ,m,dazole step gradient. I.ane I contains broad range MW markers (BioRad). Lane 2 
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contains BotB protein purified by IDA chromatography of an extract of pHisBotB/BL21(DE3) 
pLysS cells grown in shaker flask culture (^.. no co-expression of chaperones: Ex^35). 
Lane 3 contains a 20% w/v PEI clarified lysate of pHLsBotB kan iaclq 
T7/ P ACYCGro/BL2HDE3) cells (,.c. the lysate prior to purification by IDA 
chromatography). Lanes 4 and 5 contain protein which eluted at 200 or 400 mM imidazole 
respectively. Lane 6 is blank. Lanes 7 and 8 contain 1/5 the load present in lanes 4 and 5, 

As shown in Figure 36. both the chaperone and the BotB protein eluicd in 200 mM 
imidazole, and more chaperone elutes in 400 mM imidazole, however no concentration of 
imidazole tested permitted the elution of BotB protein alone. Consequently, no significant 
purification was achieved using imidazole at these concentrations. 

Because of the considerable difference in molecular weights between the folding 
chaperone. which is a multimer with a total molecular weight around 400 kD (as determined 
on a Shodex KB 804 sizing column by HPLC). and the recombinant BotB protein (molccu.ar 
ue.gh, around 50 kD). size exclusion chromatography was next examined for the ability ,„ 
separate these proteins. 

a) Size Exclusion Chromatography 

A column containing Sephacryl S-100 HR (S-100) (Pharmacia) was poured (2.5 cm x 
24 cm : 110 ml bed volume). The column was equilibrated in a buffer consisting of 
phosphate buffered saline (lOmM potassium phosphate. 150 mM NaCI. pM 7.2) and 10 % 
glycerol (Mallinkrodt). Typically. 5 ml of the IDA-purified BotB protein was filtered through 
a 0.45 At svrinyc r , Uer an(j app|jc , d u> (he coJumn ^ cqui|jbrai . on buffer pumped 

through the column at a How rate of I ml/minute. Eluted proteins were monitored bv 
ubsorbance at 280 nm and collected either manually or with a fraction collector. Appropriate 
tubes were pooled, if necessary, and the protein was quantitatcd by absorbance at 280 nm 
and/or by BCA protein assay. The isolated peaks were then analyzed by native and/or 
SDS-PAGE to identify the protein and evaluate the purity. 

Because of its larger size, the folding chaperone eluted first, followed bv the 
recombinant BotB protein. A smaller third peak was observed which failed to stain when 
analyzed by SDS-PAGE and therefore was presumed to be imidazole. 

SDS-PAGE analysis ( 1 2.5% polyacrylamide. reduced samples) was used to evaluate 
me purity of the IDA-purified recombinant BotB protein before and after S- 100 purification. 
The results are shown in Figure 33. 
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In Figure 33. lane I contains broad range M W markers ( BioRad). Lane 5 contains 
IDA-purified BotB protein. Lane 6 contains IDA-puritied BotB protein following S-100 
purification, l ane 7 is blank (lanes 2-4 were discussed in Fx. 34 above). 

The results shown in Figure 33 show that the IDA-purified BotB is significantly 
contaminated with the folding chaperone (molecular weight about 60 kl) under reducing 
conditions: lane 6). Following S-100 purification, the amount of folding chaperone present in 
the BotB sample was reduced dramatically (lane 7). Visual inspection of the Coomassie 
-stained SDS-PAGri gel revealed that after S-100 purification. > 9()o /o of the , ola| protein 
present was BotB. 

The IDA-purified BotB and the S- 1 00-purified BotB samples were analyzed by HPLC 
on a size exclusion column (Shodex KB 804): this analysis revealed that the BotB protein 
represented 64% of the total protein in the IDA-purified sample and that following S-100 
purification, the BotB protein represented >95% of the total protein in the sampled 

The IDA-purified BotB material was also applied to a AC A 44 (Speetral'or. Houston. 
TX) column. The ACA 44 resin is equivalent to the S-100 resin and chromatography using 
the ACA 44 resin was carried out exactly as described above lor the S-100 resin. The ACA 
44 resin was found to separate the recombinant BotB protein from the folding chaperone; 
The ACA 44-purifled BotB sample was analyzed for endotoxin using the LAI. nssay 
(Associates of Cape Cod) as describe in Kxample 24. Two aliqouts of the ACA 44-purillcd 
BotB preparation were analyzed and were found to contain either 58 to I 16 Lli/mg 
recombinant protein or 94 to 1 89 EU/mg recombinant protein. 

These results demonstrate that size exclusion chromatography can be used to purify the 
recombinant BotB protein from the folding chaperone and imidazole in IDA-purified material. 

b) Ultrafiltration For The Separation Of Recombinant BotB 
Protein And Chapcrones 

Ultrafiltration was examined as an alternative method for the separation recombinant 
BotB protein and folding chanerones in IDA-purified material. While in this example only 
mixtures of BotB and chapcrones were separated by ultrafiltration, this technique is suitable 
for use with recombinant BolA and BotH proteins as well provided that the wash buffers used 
are altered as necessary to take into account different requirements tor solubility. 

The recombinant BotB protein and folding chapcrones were separated using a two-step 
sequential ultrafiltration method. The first membrane used had a- nominal molecular weight 
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cutoff (MWCO) of approximately 100 kD: this membrane retains the larger folding chaperone 
while allowing the smaller recombinant protein to pass through. The addition of several 
, volumes of wash buffer may be required to efficiently wash the recombinant protein through 
the membrane. The second step utilized a membrane with a nominal' MWCO of 
approximately 10 kD. During this step, the recombinant antigen was retained by the 
membrane and could be concentrated to the degree desired and the imidazole and excess wash 
buffer passed through the membrane. 

Twenty-seven milliliters of an IDA-purified BolB preparation was ultrafiltercd through 
a 47 mm YM 100 (100 kD MWCO) membrane (Amicon) in a 50 ml stirred cell (Amicon). 
The membrane was washed in dd H : 0 prior to use as recommended by the manufacturer. Six 
volumes of 10% glycerol in PBS were washed through to remove most of the recombinant 
BotB protein and this wash was collected in a separate vessel. The resulting BotB 
proteiivrich nitrate was then concentrated 12-fold using a YM 10 ( 10 kD MWCO) membrane 
(Amicon). to a final volume of 14 ml. The YM 100 and YM 10 concentrates were analyzed 
along with the lysate starting material by native PAGE using a 4 -15% pre-cast gradient gel 
(BioRad). The : results are shown in Figure 37. 

In Figure 37. lane I contains IDA-purified BotB derived from a shaker flask culture 
(/.c. no co-expression of chaperones: lix. 35): lane 2 contains a 20% w/v PF.l clarified lysate 
of pHisBotB kan lack, T7/pACYCGro/BL2UDE3) cells: lane 3 shows the lysate of lane 3 
after IDA purification: lane 4 coniains the YM 10 concentrate and lane 5 coniains the YM 
100 concentrate. 

The results shown in Figure 37 demonstrate that the recombinant BolB protein can be 
purified away from the folding chaperone by ultrafiltration through a 100 kD MWCO 
membrane (lane 4). leaving the chaperone protein in the 100 kD concentrate (lane 5). 
Analysis of the sample in lane 5 also showed that very little of the BolB protein was retained 
by ihc 100 kD MWCO membrane after 6 volumes of wash buffer had been applied! 

The BotB samples following IDA chromatography and following ultrafiltration through 
ihc YM 100 membrane were anlyzcd by HPLC on a size exclusion column (Shodex KB 804): 
tins analysis revealed that the BotB protein represented 64% of ihc total protein in the IDA- 
purifled sample and that following ultrafiltration through the YM 100 membrane, the BotB 
protein represented >96% of the total protein in the sample. 

The BotB protein purified by ultrafiltration through the YM 100 membrane was 
examined lor endotoxin usmg the LAL assay (Associates of Cape Cod) as describe in 
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Example 24. Two aliqouts of the YM 100-purified BotB preparation were analv Z ed and were 
found to contain either 18 to 36 EU/mg recombinant protein or 1 25 to 250 EU/mg 
recombinant protein. t 

The above results demonstrate that size exclusion chromatography and ultrafiltration 
can be used to purify recombinant botulinal toxin proteins away from folding chaperones. 

EXAMPLE 41 

Cloning And Expression Of The C Fragment 
Of The C. hoiulinum Serotype E Toxin Gene 

The C. hoiulinum type l£ neurotoxin gene has been cloned and sequenced from several 
different strains [Poulet el al. (1992) Biochem, Biophvs. Res. Common. 183:107 (strain 
Beluga): Whelan el al. ( 1992) Eur. J. Biochem. 204:657 (strain NCTC 1 1219): Fujii *, „/. 
(1990, Microbiol. Immunol. .14:1041 (partial sequence of strains Mashike. Ivvani and Otaru) 
und Fujii „/ ; (1993) J. Gen. Microbiol. 139:79 (strain Mashike,). The nucleotide sequence 
«f «he type. I£ toxin gene is available from the EMBL sequence data bank under accession 
numbers X62089 (strain Beluga, and X62683 (strain NCTC I 1 219). The nucleotide sequence 
of the coding region (strain Beluga) is listed in SEQ ID NO:49. The amino acid sequence of 
the C hoiulinum type i: neurotoxin derived from strain Belgua is listed in Sl£Q ID NO:50. 
The nucleotide sequence of the coding region (strain NCTC 1 1 2 1 9, is listed in SEQ ID 
NO:5 1 . The amino acid sequence of the C \ hoiulinum type E neurotoxin derived from strain 
NCTC I 1219 is listed in SEQ ID NO:52. 

The UNA sequence encoding the native C. hoiulinum serotype K C fragment gene 
derived from the Beluga strain can be expressed as a histidinc-tagged protein using the 
pETHisb vector: the resulting coding region is listed in SEQ ID NO:53 and the corresponding 
amino acid sequence is listed in SEQ ID NO:54. The DNA sequence encoding the C 
fragment of the native C hoiulinum serotype E gene derived from the NCTC 11219 strain 
can be expressed as a histidine-tagged fusion protein using the pETHish vector: the resulting 
coding region is listed in SEQ ID NO:55 and the corresponding amino acid sequence is listed 
in SEQ ID NO:56. The C fragment region from any strain ol C hmulinum serotype If can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from ( '. hoiulinum type E 2231strain (A'I CC #17786). 



- 202 - 



WO 98/08540 PCT/US97/153M 

The. type E neurotoxin gene is synthesized as a single polypeptide chain which may be 
converted to a double-chain form </.ir,,,a heavy chain and a light chain) by cleavage with 
trypsin: unlike the type A neurotoxin, the type E neurotoxin exists essentially only in the 
single-chain form. The 50 kD carboxy-terminal portion of the heavy chain is referred to as 

' 5 the C f ragment or the H,. domain. Expression of the C fragment of C. boiulinum type E toxin 
in heterologous hosts (e.g.. E. coin has not been previously reported. 

The native C fragment of the C. hoiulinum serotype E toxin (BotE) gene was cloned 
and inserted into expression vectors to facilitate expression of the recombinant BotE protein in 
L. cali. This example involved PCR amplification of the gene, cloning, and construction of 

10 expression vectors. 

The BotE serotype gene was isolated using PCR as described for the BotA serotype 
gene in Example 28. The CV boiulinum type E strain was obtained from the American Type 
C ulture C ollection < ATCC » 1 7786: strum 223 1 ). The following primer pair was used in the 
PCR amplification: 5 -CGCC\VrGGCTCTTTCTTCTTAT ACAGATGAT-3" (5" primer. 
15 engineered \co\ site underlined) (SEQ ID NO:57) and 

5 -C JC ■AjACiCTT7T.-lTTTT7'CT'l"GCC.'AT CC ATG-3 (3" primer, engineered //mdlll site 
underlined, native gene termination codon italicized) (SEQ II) NO:58). The PCR product was 
inserted into pCRscript as described in Example 28. The resulting pCRscript BotE clone was 
confirmed by restriction digestion, as well as. by obtaining the sequence of approximately 300 
bases located at the 5" end of the C fragment coding region using standard DNA sequencing 
methods. The resulting BotE sequence was identical to that of the published C hoiulinum 
type E toxin sequence fWhelan ci al ( 1992). supra]. 

The AV*l(fillcd)////«dIII fragment from a pCRscript BotE recombinant was cloned into 
pETHisb vector as described for BotA C fragment in Example 28. The resulting construct 
was termed pHisBotE. pHisBotE expresses the BotE gene under the control of the T7 lac- 
promoter and the resulting protein contains an N-lcrminal lOXHis-tag affinity tag. 

The pHisBotE expression construct was transformed into BL2HDE3) pLysS competent 
cells and I liter cultures were grown, induced and his-iagged proteins were purified utilizing a 
NiNTA resin (elutcd in low pH elution buffer) as described in Example 28. Total, soluble 
and purified proteins were resolved by SDS-PAGE and detected by Coomassie staining. The 
results are shown in Figure 38. 

In Figure 38. lane 1 contains broad range MW markers (BioRad): lane 2 contains a 
total protein extract: lane 3 contains a soluble protein extract: lane 4 contains proteins present 
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in the How through from the NiNTA. column (this sample was not diluted prior to loading and 
therefore represents a load 5X that of the load applied lor the total and soluble extracts in 
lanes 2 and 3): lane 5 contains proteins eluted from the NiNTA column: lane 6 contains 
protein eluted from a NiNTA column which had been stored at -20°C for I year; 

The pllisBotE protein was expressed at moderate levels (7 mg/liter) as a totally soluble 
protein. The purified protein migrated as a single band of the predicted MW. 

Western blot hybridization utilizing a chicken ami-C hondinum serotype I- toxoid 
primary antibody (generated by immunization of hens as described in Example .1 using C. 
hondinum serotype E toxoid ) was also performed on the total, soluble and purified Bolt 
proteins Samples of BotA and BotB C fragments were also included on the gels to facilitate 
MW and immunogenicity comparisons. Strong immunoreaetivity was detected using the 
anti-C hoiulinum type 1- toxoid antibody only with the liotE protein. 

These results demonstrate that the native BbtE gene sequences can be expressed as a 
soluble his-iagged protein in E. co/i and purified by metal-chelation affinity chromatography: 

EXAMPLE 42 

Generation Of Neutralizing Antibodies Using The Recombinant pi lisBotE Protein 

The ability of the purified pHisBotE protein u, generate neutralizing antibodies was 
examined. Nine BAl.Bc mice were immunized with BotE protein (purified as described in 
Lx. 4| , using Cierbu GMDP adjuvant (CC Biotech). The low P H eluiion was mixed with 
Gerbu adjuvant and used to unmunize mice. Each mouse received a subcutaneous injection 
of 100 M | antigen/adjuvant mix (35 ug antigen + I ug adjuvant) on day 0. Mice were 
subcutaneously boosted as above on day 14 and bled on day 28. Mice were subsequently 
boosted and bled on day 70. 

Anti-C hoiulinum serotype E toxoid titers were determined in day 28 serum from 
individual mice from each group using the EUSA protocol outlined in Example 29 with the 
exception that the plates were coated with C. hondinum serotype li toxoid, and the primary 
antibody was a chicken ami-C hmuHnum serotype E toxoid. Seroconversion |relalive to 
control mice .mmunized with the pfcxIIisBotA antigen (Ex. 29)] was observed with all 9 mice 
immunized with the purified pHisBotE protein. 

The ability of the anti-Bolt antibodies to neutralize native ( hondinum type E toxin 
was tested in a mouse-C hoiulinum neutralization model using pooled mouse serum (sec Ex. 
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23b). The LD 5U of purified C. botulinum type E toxin complex (Dr. Eric Johnson. University 
of Wisconsin. Madison) was determined by a intraperitoneal (IP) method [Schantz and Kautler 
(1978). supraj using 18.22 g temale ICR mice. The amount of neutralizing antibodies present 
in the serum of the immunized mice was determined using serum antibody titrations. The 
various serum dilutions (0.01 ml) were mixed with 5 LD 5( , units of C. hotulinum type E toxin 
and the mixtures were injected IP into mice. The neutralizations were performed in duplicate. 
The mice were then observed for signs of botulism for 4 days. Undiluted scrum from day 28 
did not protect, while undiluted. 1/10 diluted and 1/100 diluted day 70 serum protected ( 1005 
of animals) while 1/1000 diluted day 70 serum did not. This corresponds to a neutralization 
liter of 50-500 lU/ml. 

These results demonstrate that seroconversion occurred and neutralizing antibodies 
were induced when the recombinant Both protein was utilized as the immunogen. 

EXAMPLE 43 

Construction Of Vectors To Facilitate Expression 
Of Mis-Tagged BotE Protein in Fermentation C ultures 

A number of expression vectors were constructed to facilitate the expression of 
recombinant BotE protein in large scale fermentation culture. These constructs varied as to 
the strength of the promoter utilized (T7 or T7lac) and the presence of repressor elements 
(lack,) on the plasmid. The resulting constructs varied in the level of expression achieved and 
in plasmid stability which facilitated the selection of a optimal expression system for 
fermentation scaleup. This example involved a) construction of BotE expression vectors and 
b) determination of expression levels in small scale cultures. 

a) Construction Of BotE Expression Vectors 

The BotE expression vectors created for fermentation culture were engineered to 
utilize the kanamycin rather than the ampicillin resistance gene, and contained cither the T7 or 
T7lac promoter, with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBolE 
protein described in Example 4 1 . with the only differences between clones being the alteration 
of various regulatory elements. Using the designations outlined below, the pUisBotE clone 
(Ex. 41 ) is equivalent to pHisBolE amp T7lac. 



- 205 - 



WO98/WS40 i PCT/US97/1S394 

■ ' " 

i) Construction Of pHisBotE kan laclq T7Iac 

pHisBotE kan laclq T7lac was constructed by inserting the Xbal/Mindlll fragment of 
pH.sBotE which contains ,hc BotE gene sequences into A^I/^dlU-cieaved P nT24a vector 
Proper construction was confirmed by restriction digestion. 

•i> Construction Of pi lisBotE kan T7 

pHisBotE kan T7 was constructed by ligating the BotE-containing XhaUSafl fragment 
of pHisBotE kan Jaci q T7lac to the T7 promoter-conta.ning Xhal/Sapl fragment of pET23a 
Proper construction was confirmed by restriction digestion. 

in) Construction Of pHisBotE kan lac!qT7 

pHisBotK kan laclqT7 was constructed by inserting the Z*/l I /Hindi II lra gmen, trom 
pH.sBotE kan T7 winch contains the Both gene sequences into ^/II////„dIII-cleaved P ET24 
vector. Proper construction was confirmed by restriction digestion. 

H) Determination Of BotE Expression Levels In Small Scale 
Cultures 

The three BotE kan expression vectors described above were transformed into 
BI2.(DE3) competent cells and 50 ml (2XYT + 40 M g/ n ,l kan) cultures were urown and 
mduced with ITPG as described in Example 28. Total and soluble protein extracts from 
before and after induction made as described in Example 28. The total and soluble extracts 
were resolved on a 1 2.5% SDS-PAGE gel. and his-tagged proteins were de.ected on a 
Western blot utilizing the NiNTA-alkaline phosphatase conjugate as described m Example 
31(c)(iii»-. The results showed that all three BotF. cell lines expressed his-tauged proteins of 
the predicted MW for the BotE protein upon induction. The results a.so demonstrated that 
the two constructs that contained the T7 promoter expressed the BotF. protein before 
induction, while the T7lac promoter construe, did not. Upon induction, the T7 promoter- 
containing constructs induced to higher levels than the T7lac-containing construct, with the 
pHisBotE kan lacIc,T7/B12I(DE3) cells accumulating the maximal levels of BotF protein 
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EXAMPLE 44 

Expression And Purification Of pHisBotE From Fermentation Cultures 

Based on the small scale inductions performed in Example 43. the pHisBotE kan laclq 
T7/BI2KDE3) strain was selected for fermentation scaleup. this example involved the 
fermentation and purification of recombinant BbtE C fragment protein. 

A fermentation with the pHisBotE kan laclq T7/BI2I(DE3) strain was performed as 
described in Example 31. The fermentation culture was induced 2 hrs post start of the 
glucose feed with 4 ym IPTG (final concentration 1.6 mM). The OD u „ was 42 at time of 
induction, then 46.5. 48. 53 and 54 at 1-4 hrs post induction. Viable colony counts decreased 
from 0-4 hr induction 1131. 4 (28). 7 (3), 7. 8; dilution 3 utilized 6 ul or dilution 2 cells: 
bracketed colonies are microcolonies). All (32/32) colonies scored at the time of induction 
retained the BotE plasmid (kan resistant) and no colonies at induction grew on IPTG+Kan 
pla.es (no mutations detected). These results were indicative of strong promoter induction, 
since colony viability reduced after induction, and the culture stopped growing during 
fermentation (stopped at 54 OD^/ml). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGE gel and total protein 
was detected by staining with Coomassie blue. The results are shown in Figure 39. 

In Figure 39. lane 1 contains total protein from a pHisBotA kan T7 lac/BI2 l(DE3) 
pLysS fermentation (Ex. 24). Lanes 2-9 contain extracts prepared from the above pHisBotE 
kan laclq T7/BI2I(DE3) fermentation: lanes 2- 4 contain total protein extracts prepared at 0. I 
and 2 hours post-induction, respectively. Lane 5 contains a soluble protein extract prepared at 
2 hours post-induction. Lanes 6 and 7 contain total and soluble extracts prepared at 3 hours 
post-induction, respectively. Lanes 8 and 9 contain total and soluble extracts prepared at 4 
hours post-induction, respectively. Lane 10 contains broad range MW markers (BioRad). 

The results shown in Figure 39 demonstrate that moderate level induction of totally 
soluble Bot E protein was observed, increasing from I to 4 hrs post induction (no expression 
was detected in uninduccd cells). From a 2 liter fermentation harvest a 155 gm (wet wt) cell 
pellet was obtained and used to make a PEI-clarified lysate (I liter in CRB. p|| 6.8). The 
lysate was applied to a large scale IDA column and 200 mg of BotE protein, which was 
found to be greater than 95% pure (as .judged by visual inspection of a C oomassie stained 
SDS-PAGE gel), was recovered. This represents 2.5% of the total soluble cellular protein 
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(assuming a PEI lysate having a concentration of 8 mg protein/ml) and corresponds to a vield 
ol 100 mg BotE protein/liter of fermentation culture. 

The above results demonstrate that high levels of the recombinant BotE protein can be 
expressed and purified from fermentation cultures. 

EXAMPLE 45 

Removal Of Imidazole From Purified Recombinant BotE Protein Preparations 

The expression of recombinant BotE protein, unlike the BoiA and BotB proteins did 
mu require the presence of folding chaperones to maintain solubility durinu scale-up A size 
exclus.on chromatography step was included however to remove the imidazole from the 
sample and exchange the IDA elution buffer for one consistent with the BoiA antiuen 

A Sephacryl S-100 HR (S-100: Pharmacia, column was poured (2.5 cm x 24 cm: bed 
volume - 1 10 ml). Under these conditions, the BotE protein should be retained bv the beads 
.o a lesser degree than the smaller imidazole, therefore the BotE protein should elute from the 
column before the imidazole. The column was equi.ibrated in a buffer consistinu of SO mM 
sod,um phosphate. 0.5 M NoCI. and 10% glycerol (all reagents from Mallinkrodn. l ive 
milliliters of the IDA-purified BotE protein (Ex. 44) was filtered through a 0.45 M svrinue 
filter and applied to the S-100 column, and equilibration buffer was pumped through the 
column a, a How rate of I ml/minute. Eluted proteins were monitored bv absorbance at ?80 
nm. and collected either manually or with a fraction collector. Appropriate tubes were pooled 
.1 necessary, and the protein was quan.itated by absorbance at 280 nm and/or IK A protein 
assay. The isolated peaks were then analyzed by native and/or SDS-PAUE to identify the 
proteints) and evaluate the purity. 

Figure 40 provides a representative chromatogram showing the purification of 
IDA-purified BotE on the S-100 column. Even though folding chaperones were not 
over-expressed with this antigen, a small amount of protein eluled at a time consistent with 
the folding chaperones expressed with BotA and BotB proteins «Jro> (see the first peak), The 
second peak in the chromatogram contained the BotE protein, and the third peak was 
presumably imidazole. This presumed imidazole peak was isolated in comparable levels in 
IDA-purified BotA and BotB protein preparations as well. 
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These results demonstrate that size exclusion chromatography can be used to remove 
imidazole and traces of contaminating high molecular weight proteins from IDA-purificd 
BotF protein preparations. 1 

The S- 1 00-piiri tied BotE protein was tested lor endotoxin contamination using the 
I.AI. assay as described in Example 24. This preparation was found to contain 64 to 128 
1-1 J/mg recombinant protein and is therefore substantially free of endotoxin. 

The S- 1 00 purified BotE was mixed with purified preparations of BotA and BotB 
proteins and used to immunize mice: 5 ug of each Bot protein was used per immunization 
and alum was included as an adjuvant. After two immunizations with this trivalent vaccine, 
the immunized mice were challangcd with C. ho,uli,n,m toxin. The immunized mice 
contained neutralizing antibodies sufficient to neutralize between 100.000 to TO0O.00O l.D 
of either toxin A or toxin B and between 1.000 to 10.000 I.b« of toxin L*. The titer 
neutralizing antibodies directed against toxin E would be expected to increase following 
subsequent boosts with the vaccine. These results demonstrate that a trivalent vaccine 
containing recombinant BotA. BotB and BotE proteins provokes neutralizing antibodies. 

EXAMPLE 46 

Expression Of The C Fragment Of The ( hoiulinum 
Serotype C Toxin Gene And Generation Of Neutralizing Antibodies 

The C hoiulinum type C I neurotoxin gene has been cloned and sequenced fKimura c/ 
<//, (I W0) Biochcm. Biophys. Res. Comm. 171 1 304 J. The nucleotide sequence of the toxin 
gene derived from the C. hoiulinum type C strain C-Slockholm is available from the 
EMBI./GcnBank sequence data banks under the accession number D90210: the nucleotide 
sequence of the coding region is listed in SEQ ID NO:59. The amino acid sequence of the C. 
hoiulinum type CI neurotoxin derived from this strain is listed in SEQ II) NO:60. 

The DNA sequence encoding the native C. hoiulinum serotype CI C fragment gene 
derived from the C-Stockholm strain can be expressed using the pETIIisb vector: the resulting 
coding region is listed in SEQ ID NO:6l and the corresponding amino acid sequence is listed 
m SEQ ID NO:62. The C fragment region from any strain of C. hoiulinum serotype C can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from ( hoiulinum type C C-Stockholm strain. Expression of the C fragment of C hoiulinum 
type CI toxin in heterologous hosts U '.». E. coli) has not been previously reported. 
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The C fragment of the C hoiuiinum serotype CM (BotCl ) toxin gene is cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene. A number of C. hotulinum serotype C strains (expressing either or both CI and C2 - 
toxin) arc available from the ATCC -[e.g.. 2220 (ATCC 17782). 2239 (ATCC 17783). 2223 
(ATCC 17784; a type C-0 strain: C-fJ strains produce C2 toxin). 662 ( ATCC 1 7849: a type 
C-u strain: C-a strains produce mainly CI toxin and a small amount of C2 toxin). 2021 
(ATCC 17850: a type C-a strain, and VPI 3803 (ATCC 25766,|. Alternatively, other tvpe C 
•strains may be employed for the isolation of sequences encoding the C fragment of C 
hoiuiinum serotype C toxin. 

The following primer pair is used to amplify the BotC gene: 5-CG CCATGG C 
TTTATTAAAAGATATAATTAATG-3- f5' primer, engineered AV„I site underlined (SEQ ID 
NO:63)I and 5"-GCAAGCTT7T/lTTCACTTACAGGTAC AAAACC-l" [3" primer, 
engineered HindUl site underlined, native gene termination codon italicized (SEQ ID 
NO:64)|. Following J'CR amplification, the PCR product is inserted into the pCRscript vector 
and then the 1.5 kb fragment is cloned into pHTllisb vector as described for BotA C fragment 
gene in 1 Sample 28. The resulting construct is termed pHisBotC. Proper construction is 
confirmed by UNA sequencing of the BotC sequences contained within pl lisBotC. 

pHisBotC expresses the BotC gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
PHisBotC expression construct is transformed into B1.2KDE3) pl.ysS competent cells and I 
liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
(elutcd in 250 mM imidazole. 20% glycerol) as described in .Example 28. Total, soluble and 
purified proteins are resolved by SDS-PAGE and detected by Coomassie staining and Western 
blot hybridization utilizing a Ni-N TA-alkaline phosphatase conjugate (Qiagen) which 
recognizes his-tagged proteins as described in Example 31(c)(iii). This analysis permits the 
determination of expression levels of the pHisBotC protein {i.e., number of mg/liter expressed 
as a soluble protein,. The purified BotC protein will migrate as a single band of the predicted 
MW {i.e.. -50kD). 

The level of expression of the pHisBotC protein may be modified (increased) by 
substitution of the 17 promoter lor the T7lac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL2KDE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (/>.. less than 0.5%) of soluble 
PHisBotC protein are expressed using the above expression systems, the pHisBotC construct 
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may be co-expressed with pACYCGro construct as described in Example 32. In this case, the 
recombinant BotC protein may co-purify with the folding chaperoncs. The contaminating 
chaperones may be removed as described in Example 34. Preparations of purified pHisBotC 
protein arc tested for endotoxin contamination using the l.AL assay as described in Example 
24. 

The purifed plIisBbtC protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotC protein using Gerbu GMDP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anti-BotC antibodies to neutralize native C. 
hoiulinum type C toxin is demonstrated using the mouse-C. hoiulinum neutralization model 
described in Example 36. 

EXAMPLE 47 

Expression Of The C Fragment Of The C hoiulinum 
Serotype D Toxin Gene And Generation Of Neutralizing Antibodies 

The C. hoiulinum type D neurotoxin gene has been cloned and sequenced (Sunagawa 
et al. ( 1992) J. Vet. Med. Sci. 54:905 and Binz a at. (1990) Nucleic Acids Res. 18:5556]. 
The nucleotide sequence of the toxin gene derived from the CBI6 strain is available from the 
EMBL/GenBank sequence data banks under the accession number S49407: the nucleotide 
sequence of the coding region is listed in SEQ ID NO:65. The amino acid sequence of the C. 
hoiulinum type D neurotoxin derived from the CB16 strain is listed in SEQ ID NO:66. 

I'lie DNA sequence encoding the native C. hoiulinum serotype D C fragment gene 
derived from a BotD expressing strain can be expressed using the pETIIisb vector: the 
resulting coding region is listed in SEQ ID NO:67 and the corresponding amino acid sequence 
is listed in SEQ ID NO:68. The C fragment region from any strain of (.'. hoiulinum serotype 
D can be amplified and expressed using the approach illustrated below using the C fragment 
derived from C. hoiulinum lype D CBI6 strain. Expression of the C fragment of C 
hoiulinum type D toxin in heterologous hosts (eg.. E. cnli) has not been previously reported. 

I lie C: fragment of the C. hoiulinum serotype D (BotD) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 
A number of C hoiulinum type D strains are available f rom the ATCC \v.$i„ ATCC 9633. 
2023 (ATCC 17851). and VPI 5995 (ATCC 27517)]. 
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The following primer pair is used to amplify the BotD gene: 5 ? -CGCCATGGC 
T^TTAAAAOATATAATTAATG-3- { y primer, engineered site undcr^SEO fD 
I and 5--GCA^n,CTCTACCC^^ primer, engineered 

"mHIII sue undcrhned. native gene termination codon italicized (SEQ ID NO- 6 0)l 
> ^.owing P CR ampJ i fiC ation. the PCR product Is inserted into the pC Rscript vector and ^ 

F,al 1 , T m " , Ck>nCd PE ™ iSb VCCl ° r " dCSCr,bed B ° ,A C — * 
example 28. The resulting construct is termed pHisBotD. 

PHisBotD expresses the BotD gene sequences under the transcriptional control of the 

.0 I " P r 0,Cr ^ rCSU,ting Pr0tei " C ° mainS a " N — «0XHis-,ag affinitv la , The 
pHisBotD expression construct is transformed into BL2I(DE3) P L ysS competent ceUs and , 
Lter cuhures arc grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as desenbed in Example 28. Total, soluble and purified proteins are resold bv SDS-PAGE 
and detected bv Coomassie staining and Western blot hybridization utilizing a Ni-NTA- 
alkahne phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as describe in 
»- Example ..(cXiii, This analysis permits the determination of expression ieve.s of the 

pH.sBotD protein (,.*.. number of mg/.i,er expressed as a soluble protein,. The purified BotD 
protein w,l| migrate as a single band of the predicted MW (/.c . .50kD). 

fhe level of expression of the pHisBotD protein may be modified (increased) bv 
substitution of the T7 promoter for the T7,ac promote, or by inclusion of ,he iaclo. gene on 
the expression plasmid. and p«asmid expressed in Bl.2 1(DE3) cell lines i„ fermentation 
cultures as described in Example 30. If on.v very ,ow levels (, t , less than about 0. 5% > of 
so.uble pHisBotD protein are expressed using the above expression svstems. the pHisBotD 
construe, may be co-expressed with pACYCGro construe, as described ,n Example 3- In 
^ th.s case, the recombinant BotD protein may co-purify with the folding chaperones The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pHisBotD protein are tested for endotoxin comam.nation using the LAL assav as 
described in Example 24. ' 

The purifed pHisBotD protein is used to generate neutralizing antibodies BALBc 
m.ce are .mmunized with the BotD protein using Cierbu GMDP adjuvant (CC Biotech) as 
» desenbed in Example 36. The ability of the anti-BotD antibodies to neutra.ize native C 

yuan,*, type D toxin is demonstrated using the mouse-C W,„,„„ neu.rafization mcKle. 
described in Example 36. 
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EXAMPLE 48 

Expression Of The C Fragment Of The C. holidinum 
Serotype F Toxin Gene And Generation Of Neutralizing. Antibodies 

3 The C. hotulinum type F neurotoxin gene has been cloned and sequenced [East ei al, 

( I 992) KEMS Microbiol. Lett. 96:2251. The nucleotide sequence or the toxin gene derived 
from the 202F strain (ATCC 23387) is available from the EMBL/GenBank sequence data 
banks under the accession number M92906: the nucleotide sequence of the coding region is 
listed in SEQ ID NO: 70. The amino acid sequence of the C hotulinum type F neurotoxin 
10 derived from the 202F strain is listed in SEQ ID NO:71. 

The DN A sequence encoding the native C. hotulinum serotype I- C fragment gene 
derived Iron, the 202F strain can be expressed using the pETHisb vector: the resulting coding 
region is listed in SEQ ID NO:72 and the corresponding amino acid sequence is listed in SEQ 
ID N(>:73. The C fragment region from any strain of C. hoiulimnn serotype F can be 
amplilied and expressed using the approach illustrated below using the C fragment derived 
from (' hotulinum type F 2021- strain. Expression of the C fragment of C. handinum type F 
toxin in heterologous hosts (<.<#.. E. «tli) has not been previously reported. 

The C fragment of the C. hotulinum serotype I- (BotF) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 
The < hotulinum type F 202F strain is obtained from the American Type C ulture C ollection 
(ATCC 23387). Alternatively, sequences encoding the BotF toxin may be isolated from any 
Moll- expressing strain VPI 4404 (ATCC 25764). VPI 2382 (ATCC 27321) and 

Lungeiand (ATCC 35415)). 

I he following primer pair is used to amplify the BotF gene: 5 -CG CCA TGG C 
TA TTCTAA I TA1 ATATTTTAA I'AG-3* [5* primer, engineered Xcol site underlined (SEQ 
ID NO:74)| and 5--GCA^G£TT7-C^TTCTTTCCATCCATTCTC-3- \3' primer, engineered 
HmdUl site underlined, native gene termination codon italicized (SEQ ID NO:75)]. 
Following I'CR amplification, the PCR product is inserted into the pCRscript vector and then 
the 1.5 kb fragment is cloned into pETHisb vector as described for BotA C fragment gene in 
Example 28. The resulting construct is termed pHisBolF. 

pMisBotF expresses the BotF gene sequences under the transcriptional control of the 
17 lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
PHisBolF expression construct is transformed into BL21(DK3) pLysS competent cells and 1 
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liter cultures arc grown, induced and his-tagged proteins are purified utilizing a NiNTA rcsin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassic staining and Western blot hybridization utilizing a Ni-NTA- 
alkalinc phosphatase conjugate (Qiag en ) which recognizes his-tagged proteins as described in 
Example 3I(c)(iii). This analysis permits the determination of expression levels of die 
pHisBotF protein (i.e.. number of mg/liter expressed as a soluble protein,. The purified non- 
protein will migrate as a single band of the predicted MW (i.e.. - 5()kl». 

The level of expression of the pHisBotF protein may be modified (increased) by 
substitution of the T7 promoter for the T7lac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL21(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (i.e.. less than about 0.5%) of 
soluble pHisBotF protein are expressed using the above expression systems, the plfisBotF 
construct may be co-expressed with pACYCGro construct as described in Example 32 In 
this case, the recombinant BoiF protein may co-purify with the folding chaperones. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pHisBotF protein arc tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotF protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the Botl- protein using Gerbu GMDP adjuvani (CC Biotech) as 
described in Example 36. The ability of the anti-BoiF antibodies to neutralize native r. 
ha,ulin„ m type I- toxin is demonstrated using the mouse-C'. hntuiinum neutralization model 
described in Example 36. 

EXAMPLE 49 

Expression Of The C Fragment Of The C. hoi id mum 
Serotype G Toxin Gene And Generation Of Neutralizing Antibodies 

The < •. haiulinum type G neurotoxin gene has been cloned and sequenced (Campbell e, 
at. (1993) Biochimica et Biophysica Acta 1216:487 and Binz e, a/. ( 1990) Nucleic Acids Res, 
1 8:5556 1. The nucleotide sequence of the toxin gene derived from the 1 13/30 strain (NCFB 
3012) is available from the EMBL/GenBank sequence data banks under the accession number 
X74I62: the nucleotide sequence of the coding region is listed in SEQ II) NO:76. The amino 
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acid sequence of the C. boiulinum type G neurotoxin derived from this strain is listed in SEQ 
ID NO:77. 

The DIM A sequence encoding the native C. boiulinum serotype G C fragment gene 
derived from the 1 13/30 strain can be expressed using the pETHisb vector: the resulting 
coding region is listed in SEQ ID NO:78 and the corresponding amino acid sequence is listed 
in SEQ ID NO:79. The C fragment region from any strain of C: hoiulinum serotype G can 
be amplified and expressed using the approach illustrated below using the C fragment derived 
from ( hoiulinum type G 1 13/30 strain. Expression of the C fragment of C. hoiulinum type 
G toxin in heterologous hosts («..#.. E. coli) has riot been previously reported. 

The C fragment of the C. boiulinum serotype G (BotG) toxin gene is cloned using the 
prou»cols and conditions described in Example 28 for the isolation of the native BotA gene. 
The C. hotuimum type G 1 13/30 strain is obtained from the NCFB. The following, primer 
pair is used to amplify the BotG gene: 5 -CGCCAJGGCTGAC ACAATTTTAATACA 
AG 1-3' \5' primer, engineered Ncol site underlined (SEQ ID NO:80)J and 
y<iCCm^QrrATTCTGTCCATCCTTCATCCAC-y f3' primer, engineered Xlwl site 
underlined, native gene termination codon italicized (SEQ ID NO:8l)J. lollowine PCR 
amplification, the PCR product is inserted into the pCRscript vector and then the 1.5 kb 
fragment is cloned into pETHisb vector as described for BotA C fragment gene in Example 
28 with the exception that the sequences encoding BotCi are excised from the pCRscript 
vector by digestion with Ncol and A7i«l and the Ncol site is blunted (the BotG sequences 
contain an internal ///mllll site). This AV»l(fiIlcd)/A7mI fragment is then ligatcd to the 
pETHisb vector which has been digested with Nhel and Sail and the A7/t<l site is blunted. 
The resulting construct is termed plIisBotG. 

plIisBotG expresses the BotG gene sequences under the transcriptional control of the 
I 7. lac promoter and the resulting protein contains an N-terminal lOXHis-tag affinity tag. The 
pHisBotG expression construct is transformed into BL2KDE3) pLysS competent cells and I 
liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassie staining and Western blot hybridization utilizing a Ni-NTA- 
alkaline phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
Example 3l(c)(iii). This analysis permits the determination of expression levels of the 
pHisBotG protein (i.e.. number of mg/litcr expressed as a soluble protein). The purified BotG 
protein will migrate as a single band of the predicted MW (i.e.. -50kD). 
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The level of expression of the pHisBotG protein may be modified (increased) by 
substitution of the T7 promoter for the T7lac promoter, or by inclusion of the lac.q gene on 
.the express.on plasmid. and plasmid expressed in BL2KDE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (i.e.. less than about 0 5%) of 
-■> soluble pHisBotG protein are expressed using the above expression systems, the pHisBotG 
construct may be co-expressed with pACYCGro construct as described in Example 3^ |„ 
this case, the recombinant BotG protein may co-purify with the folding chaperones The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purilled pllisBotCi protein are tested for endotoxin contamination using the LAL assay as 
10 described in Example 24. 

The purifed pHisBotG protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotG protein using Gerbu GMDP adjuvant <CC Biotech, as 
described in Example 36. The ability of the anti-BotCi antibodies to neutralize native r 
W/m,,„ type- G toxin is demonstrated using the mousc-C. h,„uUn» m neutral iz.at.on model 
I > described in Example 36. 

EXAMPLE 50 

Expression Of Recombinant Botulinal Toxin Proteins In Eucaryotic Host Cells 
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Recombinant botulinal C fragment proteins may be expressed in eucaryotic host cells, 
such as yeast and insect cells. 

a) Expression In Yeast 

Botulinal C fragments derived from serotypes A. B. C. D. E. F and G may be 
expressed in yeast cells using a variety of commercially available vectors. I or example, the 
PPIC3K and P PIG9K expression vectors (Invitrogen) may be employed for expression in the 
methylotrophic yeast. Pichia panaris. When the P PIC3K vector is employed, expression of 
the botulinal C fragment protein will be intracellular. When the P I»1C3K vector is employed, 
the botulinal C fragment protein will be secreted (the alpha factor secretion s.unal is provided 
on the pPIC9K vector). 

DNA sequences encoding the desired C fragment is inserted into these vectors using 
techniques known to the art. Briefly, the desired botulinal expression cassette (including 
sequences encoding the his-tag: described in the preceding examples) is amplilled using the 



- 216 - 



WO 98/08540 



PCT/US97/15394 



PCR in conjunction with primers that incorporate unique restriction sites at the termini of the 
amplified fragment. Suitable restriction enzyme sites include SnaBl. EcpRl. Avr\\ and -Noil. 
When the botulinal C fragment is to be expressed using the pPIC3K vector, the initiator 
methionine (ATG) is provided by the desired Bot gene sequence and a Kozak consensus 
sequence is engineered upstream of the ATG (c ACGATGG). 

The amplified restriction fragment containing the botulinal C fragment gene is then 
cloned into the desired expression vector. Recombinant clones are integrated into the Pichia 
paxtaris genome and recombinant protein expression is induced using methanol following the 
manufacturer's instructions (Invitrogen Pichia expression kit manual). 

C. hoiulinum genes are A/T rich and contain multiple sequences that arc similar to 
yeast transcriptional termination signals (e.g.. Til I TATA). If premature transcription 
termination is observed when the botulinal C fragment genes are expressed in yeast, the 
transcription termination signals present in the C fragment genes can be removed by cither 
site directed mutagenesis (utilizing the pALTER system: Promega) or by construction of 
synthetic genes utilizing overlapping synthetic primers. 

The botulinal C fragment genes may be expressed in other yeast cells using other 
commercially available vectors fe.£\. using the pYES2 vector (Invitrogen) and S. cerevisiae 
cells (Invitrogen)]. 

b) Expression In Insect Cells 

Botulinal C fragments derived from serotypes A. B. C. D. E. F and G may be 
expressed in insect cells using a variety of commercially available vectors. For example, the 
pBlueBac4 transfer vector (Invitrogen) may be employed for expression in Spodoptera 
Jhigiperda (.V/9) insect cells (baculovirus expression system) (equivalent baculovirus vectors 
and host cells are avaialble from other vendors, e.g., Pharmingen. San Diego, CA). Botulinal 
C fragments contained on NcolltlindUl fragments contained within the pHtsBotA-G 
expression constructs (described in the preceding examples) are cloned into the pBlueBac4 
vector (digested with Nco\ and HindlU): the Ncol site present on the C fragment constructs 
overlaps with the start codon of the fusion proteins. In the case of botulinal C fragment 
clones that contain internal /Y//7dIIi sites using the BotG sequences described in Ex. 49). 

the C fragment gene is contained within a NcoMXhol fragment on the pHisBot construct. 
This NcoMXhol fragment is excised from pHisBot and inserted into pBIueBac4 digested with 
Ncol and Sail. Recombinant baculoviruses are made and the desired recombinant C fragment 
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is expressed in ecus using me provi<ted by , he ^ ny . _ 

« .1* ^ constructs „i„ ^ fc pHisBol prottin ~ 

*crc..„„ of botuUnal C fnlgn , cn , pro , cjns _ thc c fra6meni ^ 
constructs „ c , oncd inl „ , hc pMe|BacB ^ (lnv . _ <tanfcd 

pBiueBac, vector. When the pMelBacB vector is entpioycd. lhe kt ^ ei 

"7~ pro,cins ° re Sccre,cd """^ " —'.ion stgna,, 

and contain a nine amino acid extension at the N-terminus 

metal 1 " i ^"" !,!ed l ,,W,,li " , " C ' - X— i"«« eeiis are p u „„ ed using 

meu.1 ehelatton eolumm as described in the preceding examples. 



t r„m the above i, is dear that the present mvention provides composition and 
rnetheds for the preparation of effective n,ul,iva,e„, vaccines against C homUmm neurotoxin 

* am ^ M * """ ' I"-*. * used for the producon „ 

antttoxms. All pul)fa , io „ s and ^ fc ^ ^ 

■ncorporated hv reference. Various mediations »d variations of ,hc described method and 
system „. the ,„ve„U„„ wM be apparent to those skilled in the art without departmg fr„ m ,he 
scope and spirit oHIk* invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Williams, James A. ! 
Thalley, Bruce S. 

(U1 TIT ^ 1 °r. INVE jr lOM: Multivalent vaccine For Clostridium 
Botulinum Neurotoxin a^j-axum 

(iii) NUMBER OF SEQUENCES: 82 



(iv) CORRESPONDENCE ADDRESS: 
ic < A > ADDRESSEE : Medlen & Carroll 

\n\ %^ ET ' 220 Mont 9^ry Street, Suite 2200 

ICJ CITY: San Francisco 

<D) STATE: California 

(E) COUNTRY: United States of America 

(F) ZIP: 94104 
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(vj COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

•(D). SOFTWARE : Patentln Release #i.o. Version #i. 30 

(vi) CURRENT APPLICATION DATA : 

(AJ APPLICATION NUMBER: US 

(B) . FILING DATE: 

(C) CLASSIFICATION: 

(viii> ATTORNEY /AGENT INFORMATION: 

(A) NAME: Carroll, Peter G 

(B) REGISTRATION NUMBER: 32 837 

.(C) REFERENCE /DOCKET NUMBER ': OPHD- 02 95 9 

<ix) TELECOMMUNICATION INFORMATION - 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX : (415) 397-8338 

< 2 ) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS - 
u (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GGAAATTTAG CTGCAGCATC TGAC 
(2) INFORMATION FOR SEQ ID NO : 2 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TCTAGCAAAT TCGCTTGTGT TGAA 

(2) INFORMATION FOR SEQ ID NO : 3 : 
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U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

^ -.'-.(C) STRANDEDNESS: single 

' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

|0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTCGCATATA GCATTAGACC 

(2) INFORMATION FOR SEQ ID NO:4: 

\^ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
2 0 CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

CTATCTAGGC C TAAAGT AT 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 base pairs 
CB) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..8130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



40 



45 



ATG TCT TTA ATA TCT AAA GAA GAG TTA ATA AAA CTC GCA TAT AGC AT- 
Met Set Leu He Ser Lys Glu Glu Leu He Lys Leu Ala Tyl Ser iTe 

5 10 15 

AGA CCA AGA GAA AAT GAG TAT AAA ACT ATA CTA ACT AAT TTA PAP 
Arg Pro Arg Glu Asn Glu Tyr Lys Thr He Lea Thr A^n llu Asp §X 
ZU 25 30 

TAT AAT AAG TTA ACT ACA AAC AAT AAT GAA AAT AAA TAT TTG CAA tt. 
Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu A^n ^ Tyr llu GJLn Uu 
35 40 45 

AAA AAA CTA AAT GAA TCA ATT GAT GTT TTT ATG AAT AAA TAT AAA ACT 
Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr ^ Thr 

55 60 

60 ler ^ ^ A G ? A CTC TCT AAT CTA "A AAA GAT ATA TTA AAA 

Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp lie ™ Lys 

70 7 5 80 

S T t tT T f" ATT ^ ^ T TCC ACA CCT GTA GAA AAA AAT 

6 . Glu Val lie Leu He Lys Asn Ser Asn Thr Ser Pro Val Gl^ £n 

85 9 0 95 

TTA CAT TTT GTA TGG ATA GGT GGA GAA GTC AGT GAT ATT GCT ptt m 

Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp ill SI III Glu 
70 100 105 no 
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TAC ATA AAA CAA TGG GCT GAT ATT AAT GCA GAA TAT AAT ATT AAA CTG 384 
Tyr lie Lys Gin Trp Ala Asp He Asn Ala Glu Tyr Asn lie Lys Leu 
115 120 125 

5 TGG TAT GAT AGT GAA GCA TTC TTA GTA AAT ACA CTA AAA AAG GCT ATA 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 140 

10 ??T ^ TCT TCT ACC ACT GAA GCA ^ CAG CTA CTA GAG GAA GAG ATT 480 

I" Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu lie 

145 150 155 160 
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CAA AAT CCT CAA TTT GAT AAT ATG AAA TTT TAC AAA AAA AGG ATG GAA 
Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tyr Lys Lys Arg Met Glu 
1^5 170 175 

TTT ATA TAT GAT AGA CAA AAA AGG TTT ATA AAT TAT TAT AAA TCT CAA 
Phe lie Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
160 185 190 

ATC AAT AAA CCT ACA GTA CCT ACA ATA GAT GAT ATT ATA AAG TCT CAT 
He Asn Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser His 
195 200 205 

CTA GTA TCT GAA TAT AAT AGA GAT GAA ACT GTA TTA GAA TCA TAT AGA 
ueu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arg 
210 215 220 
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VI iw A ^ T TCT TTG AGA *** ATA AGT ^ AT GGG ATA GAT ATC AGG 720 

m> Thr Asn Ser Leu. Arg Lys He Asn Ser Asn His Gly lie Asp He Arq 

225 2 30 235 240 

GCT AAT AGT TTG TTT ACA GAA CAA GAG TTA TTA AAT ATT TAT AGT CAG 
Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Lfeu Asn He Tyr Ser Gin 
"° 245 250 255 

GAG TTG TTA AAT CGT GGA AAT TTA GCT GCA GCA TCT GAC ATA GTA AGA 816 
Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Arg 
260 265 270 

TTA TTA GCC CTA AAA AAT TTT GGC GGA GTA TAT TTA GAT GTT GAT ATG 864 
Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asp Met 
275 280 285 

CTT CCA GGT ATT CAC TCT GAT TTA TTT AAA ACA ATA TCT AGA CCT AGC 91~> 
Leu Pro Gly He His Ser Asp Leu Phe Lvs Thr He Ser Arg Pro Ser 
290 295 300 

TCT ATT GGA CTA GAC CGT TGG GAA ATG ATA AAA TTA GAG GCT ATT ATG 96 0 

■Ml Ser He Gly Leu Asp Arg Trp Glu Met He Lys Leu Glu Ala He Met 

305 310 315 320 

AAG TAT AAA AAA TAT ATA AAT AAT TAT ACA TCA GAA AAC TTT GAT AAA 1008 
Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr Ser Glu Asn Phe Asp Lvs 
^ 325 330 335 

CTT GAT CAA CAA TTA AAA GAT AAT TTT AAA CTC ATT ATA GAA AGT AAA 1056 
Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu He He Glu Ser Lvs 
340 345 350 

AGT GAA AAA TCT GAG ATA TTT TCT AAA TTA GAA AAT TTA AAT GTA TCT 
Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 
355 360 365 

GAT CTT GAA ATT AAA ATA GCT TTC GCT TTA GGC AGT GTT ATA AAT CAA 
Asp Leu Glu He Lys He Ala Phe Ala Leu Gly Ser Val He Asn Gin 
370 375 380 
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- "e s a s S: a ss 3 as i* a a is » 

395 400 

5 a: si? a •» £ si sk m - s - ~ s ^ - s 
.« s ss is s s ss ss s ss £ s e s si S xs 

425 430 



TCA TTA TTT AAT TCA GCT ACC GCA GAA AAC TCT ATG TTT tta m »«« 
Ser Leu Phe Asa Ser Ala Thr Ala Glu Asn Ser Met SI llu Thr ^ 



445 



30 Ut K SK E - s - « £ „, ^ E _ „. ^ _ 

505 510 

Gl£ tT» ^ c GT CTA TGG AGC TTT GAT CAA GCA ACT GCA AAA TAT 

35 Gin Glu lie Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala ™ 



525 



Gl^ 'SSI TAT GTA AGA GAT TAT ACT GGT GGA TCT CTT TT GAA 

Gin Phe Glu Ly S Tyr Val Arg Asp Tyr Thr Gly Gly Ser SI sfr g™ 

40 535 540 

=f is s ss is £i s ^ s ^ 



550 — ^ n 

56 0 



555 



ZI» ^ ^ f** A T T CCA TCA ^ AAT G ™ GAA GAA GCT GGA ACT 
Leu Leu Asn Asn Lys He Pro Ser Asn Asn Val Glu Glu Ala Gl Ser 

565 570 * 



57! 



» s: si si si si K - 12 s s: ss ss a? js s 

585 590 

55 si sj. s s sz s: 2 pi si & si s ^ is s s 

600 605 

ill lI A rt* f GA ^ ATG GAA AGT GCA AAA AGC TAG TTT TTA AGT 

He lie Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe IIu Sel 

60 615 620 

a? °" s? sk s m is sk us is £ si % ™ s sr 
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640 



AGA TTA AAA AAT AAG GAA AAA GTA AAA GTA ACC TTT ATT CCA ran- 
Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr III ill o°iy nil oTy 
645 650 655 



1200 



1248 



1296 



1344 



ATA GCA CCA TAC TTA CAA GTA GGT TTT ATG CCA raa r*r~r 

lie Ala Pro Tyr Leu Gin Val Oly pH ffi £J. Ala Arg J£ "' 2 

" 5 460 

K KSJ SS SS 21 SI S.K S SI SI « s »« 

?s s e sk sk s s £ sk ^ s n; - s - s « 
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AAA GAT GAA TTG AAC ACA AGC GAA TTT GCT AGA TTA AGT GTA GAT TCA 
Lys. Asp Glu Phe Asn Thr Set Glu Phe Ala Arg Leu Ser Val Asp ler 
660 665 670 

CTT TCC AAT GAG ATA AGT TCA TTT TTA GAT ACC ATA AAA TTA GAT ATA 
Leu ser Asn G lu He Ser Ser Phe Leu Asp Thr lie L^™ EJ He 

TCA CCT AAA AAT GTA GAA GTA AAC TTA CTT GGA TGT AAT ATG TTT AGT 
Ser Pro Lys Asn Val Glu Val Asn Leu Leu Gly Cys Asn «4t Pnl s« 

TAT GAT TTT AAT GTT GAA GAA ACT TAT CCT GGG AAG TTG CTA TTA AGT 
Tyr Asp Phe Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu ilu Ser 

71 5 720 

tTI mI? A V ACT TCC ACT TTA CCT GAT GTA AAT AAA AAT TCT 

He Met Asp Lys lie Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Ser 
725 730 735 

tTI ££ T tT A GCA ^ CAA TAT GAA GTA AGA. ATT AAT AGT GAG GGA 

He Thr lie Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu G^y 

AGA AAA GAA CTT CTG GCT CAC TCA GGT AAA TGG ATA AAT AAA GAA GAA 
Atg Lys Glu Leu Leu Ala His Ser Gly Lys Trp lie Asn Lys Glu Glu 

7 60 765 

GCT ATT ATG AGC GAT TTA TCT AGT AAA GAA TAC ATT TTT TTT GAT TCT 
Ala lie Met Ser Asp Leu Ser Ser Lys Glu Tyr lie Pl£ Phi AsJ ler 
" 77 5 7 80 

ATA GAT: AAT AAG CTA AAA GCA AAG TCC AAG AAT ATT CCA GGA TTA GCA 
lie Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly Leu Ala 

Ser lie If r r^, f** A ? A TTA ™ CTT GAT G ^A AGT GTT AGT 

Ser He Ser Glu Asp lie Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 aiO 81S 

CCT GAT ACA AAA TTT ATT TTA AAT AAT CTT AAG CTT AAT ATT GAA TCT 
Pro Asp Thr Lys Phe He Leu Asn Asn Leu Lys Leu Asn lie gVu" IeT 
fl 20 825 8 3o 

TCT ATT GGG GAT TAC ATT TAT TAT GAA AAA TTA GAG CCT GTT AAA AAT 
Ser lie Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro Vai £n 
83S 840 84S 

tT A tT T S AC A * T TCT ATA GAT GAT TTA ATA GAT GAG TTC AAT CTA CTT 
He lie His Asn Ser He Asp Asp Leu lie Asp Glu Phe Asn Leu Leu 

8SS 860 

GAA AAT GTA TCT GAT GAA TTA TAT GAA TTA AAA AAA TTA AAT AAT CTA 
Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 

GAT GAG AAG TAT TTA ATA TCT TTT GAA GAT ATC TCA AAA AAT AAT TCA 
Asp Glu Lys Tyr Leu lie Ser Phe Glu Asp He Ser Lys £sn A^n Ser 
885 890 e95 

ACT TAC TCT GTA AGA TTT ATT AAC AAA AGT AAT GGT GAG TCA GTT TAT 
Thr Tyr Ser Val Arg Phe lie Asn Lys Ser Asn Gly Glu ler VaT Iy"r 
900 90S 910 

GTA GAA ACA GAA AAA GAA ATT TTT TCA AAA TAT AGC GAA CAT ATT ACA 
Val Glu Thr Glu Lys Glu lie Phe Ser Lys Tyr Ser Glu His ill Thr 

15 920 925 
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AAA GAA ATA AGT ACT ATA AAG AAT AGT ATA ATT ACA GAT GTT AAT GGT 

yS «o 116 SeC Thr 116 Lys Asn Ser Ile Ile Thr Asp Val A^n aty 

935 940 

AAT TTA TTG GAT AAT ATA CAG TO GAT CAT ACT TCT CAA GTT AAT ACA 
Asn Leu ieu Asp Asn lie Gin Leu Asp His Thr Ser Gin Val Asn Th£ 
945 950 955 960 

TI^ ^ C ^ A °? A T T C m ATT CAA TCA "A ATA GAT TAT AGT AGC AAT 
Leu Asn Ala Ala Phe Phe He Gin Ser Leu Ile Asp Tyr Ser Ser A^n 
965 970 975 

AAA GAT GTA CTG AAT GAT TTA AGT ACC TCA GTT AAG GTT CAA CTT TAT 
Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys val G^ 2J 

980 985 990 • 

? TA IZ T AGT ACA GGT TTA ACT ATA TAT GAC TCT ATC CAA 

Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tyr Asp Ser lie Gin* 
995 1000 1005 

'TIS vli i**- 7™ A T A TCA AAT GCA GTA AAT GAT ACT ATA AAT GTA CTA 
itlo ^ 5 Ala Val ^ .Asp Thr lie Asn Val Leu 

CCT ACA ATA ACA GAG GGG ATA CCT ATT GTA TCT ACT ATA TTA GAP rra 
Pro Thr He Thr Glu Gly lie Pro lie Val Ser Thr lie ™ Asp 

1030 1035 1Q 40 

?T£ T TA S? T G ? A GCA ATT TTA CTA GAC GAA CAT GAC CCA 

He Asn Leu Gly Ala Ala lie Lys Glu Leu Leu Asp Glu His Asp Pro 
1045 1050 1055 

TTA CTA AAA AAA GAA TTA GAA GCT AAG - GTG GGT GTT TTA GCA ATA AAT 
Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala lie Asn 
1060 1065 107 o 



J-J-e vdi 

1075 1Q80 1085 

GCT GAA GTT ACT ATT TTC TTA TTA CCT ATA GCT GGT ATA 1 
Ala Glu Val Thr Ile Phe Leu Leu Pro Ile Ala Gly Ile « 
1090 1095 iioo 

ATA CCT TCA TTA GTT AAT AAT GAA TTA ATA TTG CAT GAT t 
lie Pro Ser Leu Val Asn Asn Glu Leu Ile Leu His Asp I 
1105 1110 ins H 

TCA GTG GTA AAC TAT TTT AAT CAT TTG TCT GAA TCT AAA / 
Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lys t 
H25 1130 

CCT CTT AAA ACA GAA GAT GAT AAA ATT TTA GTT CCT ATT G 
Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro Ile A 
1140 j 

GTA ATA TCA GAA ATA GAT TTT AAT AAT AAT TCG ATA AAA C 
Val He Ser Glu Ile Asp Phe Asn Asn Asn Ser Ile Lys L 
H55 nee lies 

TGT AAT ATA TTA GCA ATG GAG GGG GGA TCA GGA CAC ACA G 
Cys Asn Ile Leu Ala Met Glu Gly Gly Ser Gly His Thr V 
1170 H75 1180 

AAT ATA GAT CAC TTT TTC TCA TCT CCA TCT ATA AGT TCT C 
Asn He Asp His Phe Phe Ser Ser Pro Ser Ile Ser Ser H 
H 85 1190 1195 
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TCA TTA TCA ATT TAT TCT GCA ATA GGT ATA GAA ACA GAA AAT CTA GAT 36 48 

Ser Leu Ser lie Tyr Ser Ala He Gly He Glu Thr Glu Asn Leu Asp 
1205 1210 1215 

TTT TCA AAA AAA ATA ATG ATG TTA CGT AAT GCT CCT TCA> AGA GTG TTT 36 96 

Phe Ser Lys Lys He Met Met Leu Pro Asn Ala Pro Ser Arg Val Phe 
1220 1225 1230 

TGG TGG GAA ACT GGA GCA GTT CCA GGT TTA AGA TCA TTG GAA AAT GAC 
Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asn 
1235 1240 1245 

GGA ACT AGA TTA CTT GAT TCA ATA AGA GAT TTA TAC CCA GGT AAA TTT 
V T!?!L Ar9 Leu Leu Ser Ile Ar S Asp Leu Tvr Pro Gly Lys Phe 

1250 1255 1260 . ' . 

TAC TGG AGA TTC TAT GCT TTT TTC GAT TAT GCA ATA ACT ACA TTA AAA 
Tyr Trp Arg Phe Tyr Ala Phe Phe Asp Tyr Ala He Thr Thr Leu Lys 
1265 1270 1275 !| 80 

CCA GTT TAT GAA GAC ACT AAT ATT AAA ATT AAA CTA GAT AAA GAT ACT 
Pro Val Tyr Glu Asp Thr Asn He Lys He Lys Leu Asp Lys Asp Thr 
1285 1290 --- 



1295 



AGA AAC TTC ATA ATG CCA ACT ATA ACT ACT AAC GAA ATT AGA AAC AAA 
Arg Asn Phe lie Met Pro Thr lie Thr Thr Asn Glu He Arg Asn Lvs 
1300 1305 i3i 0 

TTA TCT TAT TCA TTT GAT GGA GCA GGA GGA ACT TAC TH 1 TTA TTA TTA 
Leu Ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu Leu 
1-31S 1320 1325 

TCT TCA TAT. CCA ATA TCA ACG AAT ATA AAT TTA TCT AAA GAT GAT TTA 

SCr ??tV ryr Pr ° Ile Ser Thr Asn Ile Asn Leu Ser Lys Asp Asp Leu 
1330 1335 134Q * . 

TGG ATA TTT AAT ATT GAT AAT GAA GTA AGA GAA ATA TCT ATA GAA AAT 
Trp He Phe Asn He Asp Asn Glu Val Arg Glu lie Ser He Glu Asn 
45 1350 1355 1360 

GGT ACT ATT AAA AAA GGA AAG TTA ATA AAA GAT GTT TTA AGT AAA ATT 
Gly Thr lie Lys Lys Gly Lys Leu lie Lys Asp Val Leu Ser Lys Ile 
1365 1370 1375 

GAT ATA AAT AAA AAT AAA CTT ATT ATA GGC AAT CAA ACA ATA GAT TTT 
Asp Ile Asn Lys Asn Lys Leu lie Ile Gly Asn Gin Thr Ile Asp Phe 
1380 1385 1390 

TCA GGC GAT ATA GAT AAT AAA GAT AGA TAT ATA TTC TTG ACT TGT GAG 
ser Gly Asp ile Asp Asn Lys Asp Arg Tyr He Phe Leu Thr Cys Glu 
1395 1400 1405 

TTA GAT GAT AAA ATT AGT TTA ATA ATA GAA ATA AAT CTT GTT GCA AAA 

^f? rt ASP LyS IlG Ser Leu Ile Ile Glu Ile Leu Val Ala Lys 

1410 14 15 1420 

TCT TAT AGT TTG TTA TTG TCT GGG GAT AAA AAT TAT TTG ATA TCC AAT 
Ser Tyr Ser Leu Leu Leu Ser Gly Asp Lys Asn Tyr Leu He Ser Asn 
1425 1430 143 5 1440 

TTA TCT AAT ACT ATT GAG AAA ATC AAT ACT TTA GGC CTA GAT AGT AAA 
Leu Ser Asn Thr He Glu Lys Ile Asn Thr Leu Gly Leu Asp Ser L-s 
1445 1450 1455 
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AAT ATA GCG TAC AAT TAC ACT GAT GAA TCT AAT AAT AAA TAT TTT m 
. Asn He Ala Tyr Asn Tyr Thr Asp Glu Ser £1 ™ Tyr SJ 

1 * b ° 1465 1470 

GCT ATA TCT AAA ACA ACT CAA AAA AGC ATA ATA CAT TAT AAA AAA GAC 
Ala He er Lys Thr Ser Gin Lys^Ser lie He His Tyr ^ ™ 
H 1480 1485 ' 

AGT AAA AAT ATA TTA GAA TTT TAT AAT GAC AGT ACA TTA GAA TTT AAC 

L6U GlU J Yr AS " As * Ser T1 ~ ^ Glu JS £n 

1495 1500 

f** T" ATT GCT GAA GAT ATA GTA TTT ATG AAA GAT GAT 

S« s Ly Asp Phe lie Ala Glu Asp He Asn Val Phe Met 

1515 1520 
ATT AAT ACT ATA ACA GGA AAA TAC TAT GTT GAT AAT AAT ACT CAT nll 
He Asn Thr He Thr Gly Lys Tyr Tyr Val Asp j£J A^n Thr £J 

1525 1530 15 ^ 5 Y " 

Ser tT - A AT UH C I" ATT TCT TTA GTT AGT AAA AAT CAA GTA AAA GTA 
Ser lie Asp he Ser He Ser Leu Val Ser Lys Asn Gin Val Lyt v™ 
1540 1545 1550 X 

AAT GGA TTA TAT TTA AAT GAA TCC GTA TAC TCA TCT TAC CTT GAT T-T 
Asn Gly Le^Tyr Leu Asn Glu Serial Tyr Ser Ser Tyr Si Asp Tkl 

GTG AAA AAT TCA GAT GGA CAC CAT AAT ACT TCT AAT TTT ATG AAT TTA 

ts70 " ASP HlS HiS ASn Thr Ser Asn Phe Uu 

1575 1580 

p£I 7 TG f f AC AAT A T A AGT TTC TGG AAA TTG TTT GGG TTT GAA AAT ATA 
Phe Leu Asp Asn He Ser Phe Trp Lys Leu Phe Gly Phe Glu j£n- He 

1590 1595 1600 

AAT TTT GTA ATC GAT AAA TAC TTT ACC CTT GTT GGT AAA ACT AAT CTT 
Asn Phe Val He Asp^Lys Tyr Phe Thr Leu Val Gly L^ Thr Leu 
1605 1610 1615 

GGA TAT GTA GAA TTT ATT TGT GAC AAT AAT AAA AAT ATA GAT ATA TAT 
Gly Tyr Val Glu Phe He Cvs Asp Asn Asn Lys Asn He AsJ He Tyr 
1620 1625 16 ^ Q Y 

TTT GGT GAA TGG AAA ACA TCG TCA TCT AAA AGC ACT ATA TTT ACr rra 
Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser £hr He £1 Ser Gly 
1635 1640 1645 Y 

AAT GGT AGA AAT GTT GTA GTA GAG CCT ATA TAT AAT CCT GAT ACG GGT 

?&n Ar9 ^ GlU PL "° IlG T ^ r Asn P ™ As P "hr Gil 

1650 1655 1660 X 

GAA GAT ATA TCT ACT TCA CTA GAT TTT TCC TAT GAA CCT CTC TAT GGA 
Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tyr Gly 
1665 1670 1675 y 16 £ 0 

ATA GAT AGA TAT ATA AAT AAA GTA TTG ATA GCA CCT GAT TTA TAT ACA 
He Asp Arg Tyr He Asn Lys Val Leu He Ala Pro Asp Leu Tvr Thr 
1685 1690 l - sss 

AGT TTA ATA AAT ATT AAT ACC AAT TAT TAT TCA AAT GAG TAC TAC CCT 
Ser Leu He Asn He Asn Thr Asn Tyr Tyr Ser Asn Glu Tyr Tyr Pro 
1700 1705 l7i0 

GAG ATT ATA GTT CTT AAC CCA AAT ACA TTC CAC AAA AAA GTA AAT ATA 
Glu He lie Val Leu Asn Pro Asn Thr Phe His Lys Lys Vat A^n He 
1715 1720 1725 
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AAT TTA GAT AGT TCT TCT TTT GAG ..TAT AAA TGG TCT ACA GAA GGA AGT 

^? rt ASp Ser Ser Ser Phe Glu L V S Tr P Ser Thr Glu Gly Ser 

1730 1735 1740 

GAC TTT ATT TTA GTT AGA TAC TTA GAA GAA AGT AAT W AAA ATA TTA 
Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lvs Lys He Leu 
5 1750 1755 * 1760 



AAA ATG AGT ATA GAT TTT AAA GAT ATT AAA AAA CTA TCA TTA GGA TAT 
Lys Met Ser He Asp Phe Lys Asp lie Lys Lys Leu Ser Leu Glv Tyr 
1780 1785 J 1790 " 

t7 A * GT TTT TCA TTT ^ TCT Gft A AAT GAA TTA GAT AGA 

He Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp ArJ 
1795 1800 i 8 05 

GAT CAT TTA GGA TTT AAA ATA ATA GAT AAT AAA ACT TAT TAC TAT GAT 

i8fo LeU Gly Phe Lys fe!s Ile ASP Asn Lys T qI Tyr Tyr Ty *' ^ 

GAA GAT AGT AAA TTA GTT AAA GGA TTA ATC AAT ATA AAT AAT TCA TTA 
Glu Asp Ser Lys Leu Val Lys Gly Leu He Asn lie Asn Asn Ser Leu 
1825 1830 1635 . 184 0 



ACT AGT TAT AAA ATT ATT AAT GGT AAA CAC TTT TAT TTT AAT AAT GAT 
Thr Ser Tyr Lys lie He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp 
1875 1880 i8 8 5 

GGT GTG ATG CAG TTG GGA GTA TTT AAA GGA CCT GAT GGA TTT GAA TAT 

X YoL MGt Gln LeU Gly Val Phe L * s Gl Y Pro As P Gly Phe Glu Tyr 
1890 1895 1900 

TTT CCA CCT GCC AAT ACT CAA AAT AAT AAC ATA GAA GGT CAG GCT ATA 

Pr ° Ala ASn Thr Gln Asn Asn Asn Il€ Glu Gly Gin Ala He 
1905 1910 i9i5 192Q 



GAT AAT AAC TCA AAA GCA GTC ACT GGA TGG AGA ATT ATT AAC AAT GAG 
Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He He Asn Asn Glu 
1940 1945 19SQ 

AAA TAT TAC TTT AAT CCT AAT AAT GCT ATT GCT GCA GTC GGA TTG CAA 
Lys Tyr Tyr Phe Asn Pro Asn Asn Ala lie Ala Ala Val Gly Leu Gln 
1955 1960 1965 

GTA ATT GAC AAT AAT AAG TAT TAT TTC AAT CCT GAC ACT GCT ATC ATC 
Val He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala II* rie 
1970 1975 198Q 

TCA AAA GGT TGG CAG ACT GTT AAT GGT AGT AGA TAC TAC TTT GAT ACT 
?oL LyS Gly Trp Gin Thr VaI Asn Gl Y Ser Arg Tyr Tyr Phe Asp Thr 
1985 1990 1"5 2000 



5232 



5280 



10 ^ ^ A T A AGA A T C AAA GGT ATC TTA TCT AAT ACT CAA TCA TTT AAT 532 8 

IU Gln Lys He Arg He Lys Gly He Leu Ser Asn Thr Gln Ser Phe Asn 

1765 1770 1775 



5376 



5424 



54 72 



5520 



10 II I TTT GAT CCT ATA GAA TTT AAC TTA GTA ACT GGA TGG CAA ACT 556 8 

M) Phe Tyr Phe Asp Pro He. Glu Phe Asn Leu Val Thr Gly Trp Gln Thr 

1845 i e5 o ie55 

ATC AAT GGT. AAA AAA TAT TAT TTT GAT ATA AAT ACT GGA GCA GCT TTA 
He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
JJ 1860 1865 i 8 70 



5616 



5664 



5711 



5760 



SO w T ^ A Sf* AGT ^ ^ TTA AGT TTG AAT GGC AAA AAA TAT TAT TTT 5808 

Val ^/r Gln Ser Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr T-r Phi 
1925 1930 X935 



5856 



5904 



5952 



6000 
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OAT ACC OCT ATT GCC TTT AAT GGT TAT AAA ACT ATT GAT GCT 

Asp Thr Ala He *j. -Ph." Asn Gly Tyr ^ *H Asp S?J J£. Sis 

2010 2015 

' SSI ^vr pII e GT GAT TGT GTA GTG ftft A ATA GGT GTG TTT ACT ACC 

Phe Tyx Phe Asp Ser Asp Cys Val Val Lys He Gly Val hiZ Ser Thr 

zu^u 2025 



2030 



6096 



TCT AAT GGA TTT GAA TAT TTT GCA CCT GCT AAT ACT TAT AAT AAT ^ 
Ser Asn G y Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr £n £n £n 

2040 2045 

rlt CAG GCT ATA GTT TAT CAA AGT AAA TTC TTA ACT TTG AAT 

lie CX^Gly Ola Ala He Va^Tyr Gin Ser Lys p^ZE ThT 

Gl3 tC* *** T AT ^ AC TTT GAT ^ TCA AAA GCA GTT ACC GGA TTG 

Gly Lys Lys Tyr Tyr Phe Asp Asa Asn Ser Lys Ala Val Thr G^y lIu 

20 ° 2075 2080 

C^ ThI ill c GT ?** TAT TAC ^ ACT AAC ACT GCT GAA 

Cln Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 

2085 2090 2095 

GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAP rrr a 
" ia THr Gi » Asp Gly ^ -Tyl *™ £n 

" 2i05 2110 

A* 0 ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA 
Thr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Tnr lH Asp G^y Lys 
'US 2120 2125 y >- 

AAA TAT TAC TTT AAT ACT AAC ACT GCT ATA GC^ TCA ACT GCT tit a™ 
Lys T yr Tyr, Phe Asn Thr Asn Thr Ala lie Ala ler £hr oTy Tyl Thr 
" 13 ° 2135 2140 

ATT ATT AAT GGT AAA CAT TTT TAT TTT AAT ACT GAT GGT ATT ATG CAP 
lie He Asn Gly Lys „i he Tyr pne ^ ^GT ATT ATG CAG 

2150 2155 2160 

ATA GGA GTG TTT AAA GGA CCT AAT GGA TTT GAA TAT TTT GCA CCT GCT 
-le Gly Val Pne Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro A^I 
^ 165 2170 2175 

fAT "ACQ GAT GCT AAC AAC ATA GAA GGT CAA GCT ATA CTT TAC PAA AAT 
Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He III £yr Gin 
2180 2185 -190 

ll C '7™ tu T TTG GGT *** TAT TAC TTT GGT AGT GAC TCA 

Clu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tvr Phe Gly S^r Asc Ser 
-195 2200 2205 

?? A w T T ^ GGA TGG AGA ATT ATT MC AAT AAG AAA TAT TAC TTT 
Lys Ala Val Thr Gly Trp Arg He lie Asn Asn Lys Lys Tyr Tyr Pnl 
*^ 1U 2215 2220 

AAT CCT AAT AAT GCT ATT GCT GCA ATT CAT CTA TGC ACT ATA- AAT AAT 
Asn Pro Asn Asn Ala lie Ala Ala He His Leu Cys Thr lie 
" -230 2235 224Q 

GAC AAG TAT TAC TTT AGT TAT GAT GGA ATT CTT CAA AAT GGA TAT ATT 
Asp Lys Tyr- Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly ™r iTl 
2245 2250 



6192 



6240 



6286 



6336 



6384 



643: 



6480 



6528 



6576 



6624 



6672 



6 720 



6768 



Thr til ^ n ^ T ^ T TAT TTT GAT GCT AAT AAT GAA TCT AAA 

Thr He Glu Arg Asn Asn Phe- Tyr Phe Asp Ala Asn Asn Glu Ser Lys 
2260 2265 2270 



6816 
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ATG GTA ACA GGA GTA TTT AAA GGA CCT AAT GGA TTT GAG TAT TTT GCA 

Met Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala 
2275 2280 2 285 

CCT GCT AAT ACT CAC AAT AAT AAC ATA GAA GGT CAG GCT^ ATA GTT TAC 

Pro Ala Asn Thr His Asn Asn Asn He Glu Gly Gin Ala He Val Tvr 

22 ?° 2295 2300 



15 



20 



40 



60 



65 



** 3 ° J 2310 2315 2320 

GAC TCA AAA GCA GTT ACT GGA TGG CAA ACC ATT GAT GGT AAA AAA TAT 
Asp Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tvr 
2325 2330 2335 

TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT 
Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He 
2340 234S 2350 

GAT GGT AAA AAA TAT TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT 
Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2365 

GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT 
Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
2370 2375 2380 



6864 



6912 



III C ? G ■ AAA " TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT GAT AAT 6 960 

IW Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asp Asn 



7008 



7056 



7104 



7152 



TTC ATA GCC TCA ACT GGT TAT ACA AGT ATT AAT GGT AAA CAT TTT TAT 7200 
•>U Phe lie Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lys His Phe Tvr 

2385 2390 2395 2400 

TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT AAT 724 8 

Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lvs Gly Pro Asn 
-° 2405 2410 * 2415 

GGA TTT GAA TAC TTT GCA CCT GCT AAT ACG GAT GCT AAC AAC ATA GAA 72 96 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu 
2420 2425 2430 

GGT CAA GCT ATA CTT TAC CAA AAT AAA TTC TTA ACT TTG AAT GGT AAA 7344 
Gly Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lvs 
2435 2440 2445 

45 AAA TAT TAC TTT GGT AGT GAC TCA AAA GCA GTT ACC GGA CTG CGA ACT 7 3 92 

Lys Tyr Tyr Phe Gly Ser Asp. Ser Lys Ala Val Thr Gly Leu Arq Thr 
2450 2455 2460 

ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT GCT GTT GCA GTT 74 4 0 

J» He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 

2465 2470 2475 2480 

ACT GGA TGG CAA ACT ATT AAT GGT AAA AAA TAC TAC TTT AAT ACT AAC 74 88 

Thr Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
" 2465 2490 2 495 

ACT TCT ATA GCT TCA ACT GGT TAT ACA ATT ATT AGT GGT AAA CAT TTT 7S36 
Thr Ser He Ala Ser Thr Gly Tyr Thr He He Ser Gly Lvs His Ph<> 
2500 2505 2510 

TAT TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT 7 58 4 
Tyr Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro 
2515 2520 2525 

GAT GGA TTT GAA TAC TTT GCA CCT GCT AAT ACA GAT GCT AAC AAT ATA 76"i2 
Asp Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He 
2530 2535 2540 
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50 



60 



65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



20 

A'S Leu 
35 



50 



65 



70 " 70 75 



Glu 


Glu 


Leu 


He 
10 


Lys 


Leu 


Ala 


Tyr 


Sei- 
15 


Tyr 


Lys 


Thr 

25 


He 


Leu 


Thr 


Asn 


Lev* 
3 0 


Asp 


Asn 


Asn 
40 


Asn 


Glu 


Asn 


Lys 


Tyr 

-35 


Leu 


Gin 


He 
55 


Asp 


Val 


Phe 


Met 


Asn 
60 


Lys 


Tyr 


Lys 


Leu 


Ser 


Asn 


Leu 


Lys 


Lys 


Asp 


He 


Leu 



. |^ §K SI E - » ™. * « . , M0 

■ 5 is s . s s srs? £ s s S! S 

2585 2590 

2600 2605 

GGT TTA CCT CAG ATA GGA GTG TTT AAA GGG Tr-r a at rv-n -r~™, ~ 
OXy L.U Pro G X„ XXe. oiy V.J p2 Sy S ^ £J J™ «J ™ 

>0 2615 2620 ' 

S, S - - - S5 E 22E SI "i » - s ffi 



7872 



7 920 



7968 



S5 JS ^ £^ S - s f£ £ 35 ST- K - 
SS ?S E ffi £1 J5 K S -S 5Wk 

^ 6B0 2685 



8064 



8112 



J£ ssk a;, sir « s js Si ;n 2; ss k 

40 2695 2700 

GCC CCT GGG-A-TA TAT GGC TAA 

Ala Pro Gly He Tyr Gly 8133 

(2) INFORMATION FOR SEQ ID NO : 6 : 
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10 



15 



30 



40 



45 



(>0 



65 



Glu Val He Leu He Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 
65 90 95 

Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp He Ala Leu Glu 
100 105 no 

Tyr He Lys Gin Trp Ala Asp lie Asn Ala Glu Tyr Asn' He Lys Leu 
115 120 125 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 140 

Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu He 
145 150 155 160 

Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tvr Lvs Lys Arg Met Glu 
165 170 ' 175 

Phe lie Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
- u 180 185 



190 



He Ash Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser Hi- 
195 200 



205 



Leu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Ara 
210 * 215 220 

Thr Asn Ser Leu Arg Lys He Asn Ser Asn His Gly He Asp lie Arq 
225 230 235 240 

Ala Asn Ser Leu Phe thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
245 250 255 

lu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Ara 

-O 260 265 — 



70 



Leu Leu Ala Leu Lys Asn Phe Gly Gly Val tyr Leu Asp Val Asd Met 
27 5 280 285 

Leu Pro Gly He His Ser Asp Leu Phe Lys Thr He Ser Arq Pro Ser 
290 295 300 

Ser lie Gly Leu Asp Arg Trp Glu Met He Lvs Leu Glu Ala He Met 
305 310 3i5 320 

Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr Ser Glu Asn Phe Asp Lvs 
325 330 335 ' 

Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu He He Glu Ser Lvs 
M ' 340 345 350 

Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 
355 360 365 

:0 Asp Leu Glu He Lys He Ala Phe Ala Leu Gly Ser Val He Asn Gin 

370 375 380 

Ala Leu He Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val He Glu 
385 390 395 4 00 

Gin Val Lys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 
405 410 



415 



He Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys He Phe His Asp 

420 425 430 

Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 

435 440 445 
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IXe Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg Ser Thr 

455 460 
He Ser Leu Ser Gly Pro Gly Ala Tvr at* *y 

465 47 o Y Aa lyr Ala Ser Ala Tyr Tyr Asp Phe 

475 1 480 

lie Asn Leu Gin Glu Asn Thr lie Glu Lys Thr Leu Lys Ala Ser Asp 

490 495 
Leu lie Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 

505 510 
Gin Glu lie Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lys Tyr 

520 525 

Oln Phe Glu Lys Tyr Val Arg Asp Tvr Thr Gly Gly Ser Leu Ser Glu 

535 540 

Asp Asn Gly val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 



555 560 



Leu Leu Asn Asn Lys lie Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 



570 



575 



LyS ASn Tyi ' Si HiS Tyr Ile Ile «-» cm Gly Asp Asp „. Ser 

585 590 



Tyr Glu Ala Thr Cvs Asn Leu Phe Ser Lys A Sn Pro Lys Asn Ser He 

600 605 
lie lie Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe Leu Ser 

615 620 

5 2 5 G1U Ser Hi L6U Glu Leu A ="> Lys- Tyr Arg He Pro Glu 

635 640 
Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe lie Gly His Gly 
645 650 655 

Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 

665 670 

Leu ser Asn Glu He Ser Ser Phe Leu Asp Thr He Lys Leu Asa He 

680 685 

S,r Pro Lys Asn Val Glu Val Asn Leu Leu Gly cys Asn Mot Phe Ser 



7 00 



Tyr Asp Phe Asn Val Glu Glu Thr Tyr Pro Gly Lyo Leu Leu Leu Ser 

71 5 720 



730 ' V 3 's ^ 



He Met Asp Lys He Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Se 

He Thr lie Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly 

Arg Lys Glu Leu Leu Ala Hxs Ser Gly Lys Trp He Asn Lvs Glu Glu 

760 765 

Ala He „b t Sor Asp Leu ser Lys Giu Tyr ne 

775 780 
U« Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly Leu Ala 

795 800 
Ser He Ser Glu Asp He Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 810 815 
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- ' . ■ . . f 

Pro Asp Thr Lys Phe lie Leu Asn Asn Leu Lys Leu Asn lie Glu Ser 
820 825 830 

Ser He Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro, Val Lys Asn 
835 840 845' 

lie lie His Asn Ser lie Asp Asp Leu lie Asp Glu Phe Asn Leu Leu 
850 855 860 

Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 
865 870 875 880 

Asp Glu Lys Tyr Leu He Ser Phe Glu Asp lie Ser Lvs Asn Asn Ser 
885 890 * 895 

Thr Tyr Ser Val Arg Phe He Asn Lys Ser Asn Gly Glu Ser Val Tyr 
900 90S 910 

Val Glu Thr Glu Lys Glu He Phe Ser Lys Tyr Ser Glu His He Thr 
915 920 925 

Lys Glu He Ser Thr He Lys Asn Ser He He Thr Asp Val Asn Gly 
930 935 940 

Asn Leu Leu Asp Asn lie Gin Leu Asp His Thr Ser Gin Val Asn Thr 
945 950 955 960 

Leu Asn Ala Ala Phe Phe He Gin Ser Leu He Asp Tyr Ser Ser Asn 
965 970 975 

Lys Asp vai Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gin Leu Tyr 
980 985 990 

Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tyr Asp Ser He Gin 
995 1000 1005 

Leu Val Asn Leu lie Ser Asn Ala Val Asn Asp Thr He Asn Val Leu 
1010 1015 1020 

Pro Thr lie Thr Glu Gly He Pro He Val Ser Thr He Leu Asp Gly 
1025 1030 1035 1040 

He Asn Leu Gly Ala Ala He Lys Glu Leu Leu Asp Glu His Asp Pro 
1045 1050 1055 

Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala He Asn 
1060 1065 1070 

Met Ser Leu Ser lie Ala Ala Thr Val Ala Ser lie Val Gly He Gly 
1075 1080 1085 

Ala Glu Vai Thr He Phe Leu Leu Pro He Ala Gly He Ser Ala Gly 
1090 1095 lioo 

He Pro Ser Leu Val Asn Asn Glu Leu He Leu His Asp Lys Ala Thr 
1105 1110 ins 1120 

Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lvs Lys Tyr Gly 
H25 H30 ' 1135 

Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro He Asp Asp Leu 
1140 1145 H50 

Val He Ser Glu He Asp Phe Asn Asn Asn Ser He Lvs Leu Gly Thr 
1155 1160 ii65 

Cys Asn lie Leu Ala Met Glu Gly Gly Ser Gly His Thr Val Thr Gly 
H70 H75 1180 
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Ss 116 ASP HiS Phe ^| o Ser Ser Pro Ser He Ser Ser His Ile Pro 

1195 120 
Ser. Leu Ser lle Ty^Ser Ala Xle Gly Xle Glu Thr Glu Asn Leu ^ 



1210 1215 



Ph e Ser L ys Ly^Xle „e t Met Leu ^ Asn Ala prQ Sgr ^ 
Trp Trp Gl^Thr Gly Ala val p r . Gly Leu Arg ser ffig «r^n Asp 

Gly ThrAr* Leu Leu Asp Ser lie Arg Asp Leu Tyr Pro Glv Ly a p he 

1260 

IVes^ ^ ^ Tyr ^70^ Phe ^ Ala lie Thr Thr Leu Lys 

1275 1280 
Pro val Tyr OXu Asp^hr Asn xle Lys xi^Lys ^ Asp Lys 

M ' 9 ASn ^ llto" et »a.Th o% Thr Asn Glu Xle Ar^Ly. 

UU Il[s Ser ASP ^ A ^ Q G1 V «y Thr Tyr JerLTueu Leu 

Ser s-rryr Pro rle Ser sn ^ ^ ^ ^ ^ 

" LJJ J 1340 

^ ^ 5 Giy »• ^Asp Val Leu ser Lys^T 

ASP lie Asn Ly^Asn Lys Leu Ile lie oly Asn Gin Thr xle As^Phe 

1385 13go 

S " G1> ' A ! P 5 Ile ASP AS " LyS ^ Ar " *Y* lie Phe Leu Thr Cys oiu 

HOU 1405 

Leu Asp o A S p Lys Xle Ser Leu lie Xle Glu lie Asn Leu Val Ala Lys 

1420 

?«5 T " /l ' L6U ^3o Ser GlY ASP LyS A ^ 5 T >- r Leu "e Ser Asn 

LSU ASn Ut° l » Lys lle *.n Thr Leu Gly Leu Asp Ser Vys 



1450 14SS 



Asn Ile AXa T^Asn T yr Thr Asp Clua.rta Asn L ys Tyr Phe Gly 

1465 1470 

Al. Xle s.r L y. Thr Ser Gin Lys Ser Xle Xle His Tyr Lys Lys Asp 

1480 1485 

^io ASn LSU G1U fSs^.*"" AS P S« Thr Leu Glu Phe Asn 

1495 1500 

SerLy. Asp Phe lie Ala^lu Asp Xle Asn Val^he Met Lys Asp Asp 
Xle Asn Thr Xle TJrOly Lys Tyr Tyr Val^Asp Asn A,n Thr 

ser Xle Asp Phe ser Xl e Se r Leu yal Ser Lys Asn Gin Val Lys^al 

1545 1550 
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Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1560 1565 

Val Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 
1570 1575 isao 

Phe Leu Asp Asn He Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn He 
1585 1590 1595 160 0 

Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Glv Lys Thr Asn Leu 
1605 1610 * 1615 

Gly Tyr Val Glu Phe He Cys Asp Asn Asn Lys Asn He Asp lie Tyr 
1620 1625 i 6 3o 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Glv 
1635 1640 1645 

Asn Gly Arg Asn Val Val Val Glu Pro He Tyr Asn Pro Asp Thr Glv 
1650 1655 1660 Y 

Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tvr Glv 
1665 1670 1675 * 16 qo 

lie Asp Arg Tyr He Asn Lys Val Leu lie Ala Pro Asp Leu Tyr Thr 
16 8 5 16 90 16 95 

Ser Leu He Asn He Asn Thr Asn Tyr Tyr Ser Asn Glu Tvr Tyr P-o 
1700 1705 1710 

Glu lie He Val Leu Acn Pro Asn Thr Phe His Lys Lys Val Asn He 
1715 1720 - 1725 

Asn Leu Asp Ser Ser Ser Phe Glu Tyr Lys Trp Ser Thr Glu Gly Ser' 
1730 1735 1740 

Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lys Lys He Leu 
1745 1 7 50 1755 " 176 o 

Gin Lys He Arg lie Lys Gly He Leu Ser Asn Thr Gin Ser Phe Asn 
1765 1770 1775 

Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser Leu Gly Tvr 
1780 1765 1790 " 

lie Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp Arq 
1795 1800 1605 

Asp His Leu Gly Phe Lys He He Asp Asn Lys Thr Tvr Tyr Tyr Asp 
1810 1815 1820 

Glu Asp Ser Lys Leu Val Lys Gly Leu He Asn He Asn Asn Ser Leu 
182S 1830 1835 1840 

Phe Tyr Phe Asp Pro He Glu Phe Asn Leu Val Thr Gly Trp Gin Thr 
1845 1650 1855 

He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
I860 1865 1870 

Thr Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp 
!875 1880 1885 

Gly Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr 
1890 1895 1900 

Phe Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala He' 
1905 1910 1915 19 20 
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val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe 

A: '^ 0 1930 



193 5 



Asp Asn Asn ser Ly S Ala Val Thr Gly Trp Arg lie n e Asn Asn Glu 

1945 J950 
Lys Tyr T^Phe As „ Pro Asn Asn^Ala He Ala Ala Val Gly Leu Gln 



1965 



19?0 ASP ASn LyS lr* YT Phe ASn Pr ° Afi P Thr A1 * "e lie 

1975 1980 

B65 LyS Gln I^ 0 Val Asn ^ Ser Arg Tyr Tyr Phe Asp Thr 

1995 . 2000 

Asp Thr Ala lie Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His 

2010 2015 
Phe Tyr Phe Asp Ser Asp Cys Val Val Lys lie Gly Val Phe Ser Thr 

2030 

Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 

^° 40 2045 

lie Glu Gly Gln Ala He Val Tyr Gln Ser Lys Phe LeU Thr Leu Asn 

2060 

Gly Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 

- 070 2075 ?oeo 

Gln Thr He A sp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 

£vob *"» n a n 

^° 90 2095 

Ala Ala Thr Gly Trp Gln Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn 

~ 2105 7.110 

Thr ' ASn ™[ Ala Glu Ala Ala Jhr Gly Trp Gln Thr He Asp Gi/lvs 
** ^120 2125 

IjVS ^lo^ ^ A f" THr Ala Ile Ala S " Thr Gly Tyr Thr 

2135 2140 

lie lie Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Me: Gln 

- 150 . 2155 2160 

He Gly val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro AJ a 

2170 2175 

Asn Thr Asp Ala Asn Asn He Glu Gly Gln Ala lie Leu Tvr Gln Asn 
" 18 ° 2185 2 i 90 

Glu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser 
^ Aab 2200 2205 

Lys Ala val Thr Gly Trp Arg lie He Asn Asn Lys Lys Tyr Tyr Phe 

-215 2220 

Asn Pro Asn Asn Ala lie Ala Ala He His Leu Cys Thr He Asn Asn 

2230 ooic; 



2240 



Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gln Asn Gly Tvr He 

2250 2255 

Thr He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu s-r Lys 

^ b b 2270 

Mec Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala 

2280 2285 
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Pro Ala Asn Thr His Asn Asn Asn He Glu Gly Gin Ala lie Val Tyr 
2290 2295 2300 

Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tvr Tyr Phe Asp Asn 
2305 2310 2315 ' 2320 

Asp Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr 
2325 2330 2335 

Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He 
2340 2345 2350 

Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2365 

Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
2370 2375 2380 

Phe lie Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lvs His Phe Tvr 
2385 2390 2395 ' 2400 

Phe Asn Thr Asp Gly He Met Gin lie Gly Val Phe Lys Gly Pro Asn 
2405 2410 2415 

Gly Pne Glu Tyr Phe Ala Pro Ala Ash Thr Asp Ala Asn Asn He Glu 
2420 2425 2430 

Gly Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lvs 
2435 2440 2445 

Lys Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr 
245Q 2455 2460 

He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 
2465 2470 2475 2480 

Thr Gly Trp Gin Thr He Asn Gly Lys Lys Tvr Tyr Phe Asn Thr Asn 
2485 2490 2495 

Thr Ser He Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Phe 
2500 2505 2510 

Tyr Phe Asn Thr Asp Gly He Met Gin lie Gly Val Phe Lys Gly Pro 
2515 2520 2525 

Asp Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He 
2530 2535 2540 

Glu Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
2545 2550 2555 2560 

Asn He Tyr Tyr Phe Gly Asn. Asn Ser Lys Ala Ala Thr Gly Trp Val 
2565 2570 2575 

Thr He Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly 
2580 2585 2590 

Ala Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tvr Phe Arg Asn 
2595 2600 2605 

Gly Leu Pro Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tyr 
2610 2615 2620 

Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He 
2625 2630 2635 2640 

Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys He Tvr Tyr Phe 
2645 2650 ' 2655 



WO98/0854 ° ! PCT/U S! )7A5394 

^ ^ fllo^ ^ Val Thr °^ S T ^ Thr xie Asn ci y Lys 

^ b65 2670 

^ Pr ° AS * I 6 V la M « Ala Ala Gly Giy Leu 



Phe Gillie Asp Gl y val Il^Tyr Phe Phe Gly Val Asp Gly Val 

2700 

Ala Pro Gly H e Tyr Gly ( 
2705 2710 

INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH; 811 amino acids 

(B) TYPE: amino acid 

JO STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

<*i> SEQUENCE DESCRIPTION: GEQ ID NO: 7: 

Ser Tyr Lys i le Il e Asn Gly Lys His Pne Tyr Pne Asn Asn Asp Gly 

10 15 
Val Met Cl „ uu Gl y val Phe L yS aiy Pro Asp aiy Phe oiu Tyr Phe 

Ala Pro Ala Asn Thr Gin Asn Asn Asn Xle Glu Gly Gin Ala He Val 

Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lya Tyr p ^ 

ib 60 

■ ASH ASR ^3 A ^ ^ ^.^ ^ ^ ne a ^ ^ Lyg 

Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val ciy Leu Gin llx 



90 



105 110 



He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He Ser 

Ly- Qly Trp Gin Thr Val Asn Gly Ser Ar g Tyr Tyr Phe Asp Thr Asp 

±<zu 125 

Thr Ala He Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His Phe 

135 140 
Tyr Phe Asp Ser Asp Cy S Val Val L ys He Gly Val Phe Ser Thr 



150 



155 



Ser 
160 



Asn Gly p he Glu Tyr Phe Ala Pro Ala Asn Thr T yr Asn Asn Asn lie 



170 



175 



Glu «y Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly 

185 190 

Lys Ly S Tyr Tyr Phe Asp Asn Asn Ser L y s Ala Val Thr Gly Leu Gin 

«00 205 

Thr He Asp S «r Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu Ala 

Ji5 220 

Ala Thr Gly Trp Gin Thr He Asp Gl y L yS Lys Tyr Tyr Phe Asn Thr 



35 240 
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Asn Thr Ala Glu Ala Ala Thr.Gly Trp Gin Thr He Asp Gly Lvs Lvs 
245 250 255 

Tyr Tyr Phe Asn Thr Asn Thr Ala lie Ala Ser Thr Gly Tyr Thr He 
260 265 I 270 

He Ash Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Met Gin He 
275 280 285 

Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn 
290 295 300 

Thr Asp Ala Asn Asn lie Glu Gly Gin Ala He Leu Tyr Gin Asn Glu 
305 310 315 320 

Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser Lvs 
325 330 335 

Ala Val Thr Gly Trp Arg lie lie Asn Asn Lys Lys Tvr Tyr Phe Asn 
340 345 35o 

Pro Asn Asn Ala He Ala Ala He His Leu Cvs Thr He Asn Asn Asn 
355 360 * 365 

Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He Thr 
370 375 380 

He Giu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys Met 
385 390 395 4 00 

Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro 
405 410 41.5 

Ala Asn Thr His Asn Asn Asn He Glu Gly Gin Ala lie Vdl Tyr Gin 
420 425 430 

Asn Lys Phe Leu Thr Leu Asn Gly Lys Lvs Tyr Tyr Phe Asp Asn Asp 
435 440 445 

Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lvs Tyr Tvr 
450 455 460 * 

Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr lie Asp 
46S 470 475 480 

Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly 
485 490 495 

Trp Gin Thr lie Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Phe 
500 505 sio 

He Ala Ser Thr Gly Tyr Thr Ser lie Asn Gly Lys His Phe Tvr Phe 
515 520 S25 

Asn Thr Asp Gly lie Met Gin He Gly Val Phe Lys Gly Pro Asn Glv 
530 535 540 

Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn ll~ Glu Glv 
545 550 555 ~ 5 60 

Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Glv Lvs Lvs 
565 570 575 

Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr He 
580 S 85 590 

Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val Thr 
555 600 605 
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Gly xrp Gin Thr He Asn aiy Lys . Lys Tyr Tyr Asn Tnr ftsn Thr 

Ser xie Ala Ser Thr oly r yr Thr Ile Ile ^ Gly Lyg ^ 
P he Asn Thr Asp ^ Ile Met Gln ^ .[[ ^ ^ g ^ 

Oly Phe ci„ ryr P he Ala Pro Ala A| „ Thr ^ ^ ^ ^ ^ ^ 

Ht T ^ U1 " ^ — "I. 0 Asp 

685 

»e Tyj Tyr Phe Gly Asn A| „ ser Lys flla Ma ^ v ^ ^ 

lie Asp Gl y As „ Arg Tyr Tyr phe Glu ^ A ^ 

Asn Giy Tyr Lys Thf Ile ^ ^ Lys Asn Phe Tyr Phe Arg ^ Qly 

Leu Pro Gln lit Gly val Phe ^i ser Asn Gly phe ^ «* 

Ala Pro Ala Asn Thr Asp Ala Asn Asn lle Clu Oly «„ Z n e Ar 9 
Tyr «i„ Asn Arg phe Leu l ^ t ^ 

7 /5 780 Y 

Wa.„ Ser L ys Ala val T hr Gly Trp Gln ^ Jle ^ ciy ^ ^ 

Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala 
805 810 
2) INFORMATION FOR SEQ ID NO ; 8 : 



795 800 



(l) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 91 ammo acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

Ui) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser Tyr Lys He lie Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 

Val Met Gln Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr Phe 

Ala P,o Ala Asn Thr Gln Asn Asn Asn lie Glu Gly Gln Ala He Val 

* u 45 
Tyr «„ ser L ys Phe Leu Thr Leu Asn Gly Lys Lys Tyr T/r phe 

^ b 60 
Asn Asn Ser L ys Ala Val Thr ciy Trp Ar g lie lie Asn Asn Gl u Lys 

Tyr Tyr Phe Asn Pro Asn Asn Ala lie Aia Aia 



80 



85 90 
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10 



20 



40 



50 



60 



65 



<2> INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 7101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA (genomic) 

(ix> FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 7098 



1:1 SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



ATG AGT TTA GTT AAT AGA AAA CAG TTA GAA AAA ATG GCA AAT GTA AGA 
Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Val Arq 
1 5 10 - - 



15 



TTT CGT ACT CAA GAA GAT GAA TAT GTT GCA ATA TTG GAT GCT TTA GAA 
Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala lie Leu Asp Ala Leu Glu 
20 25 30 

GAA TAT CAT AAT ATG TCA GAG AAT ACT GTA GTC GAA AAA TAT TTA AAA 
Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lvs Tvr Leu Lys 
^5 40 45 



45 AAT TAT ATA AAT CAA TGG AAA GAT GTA AAT AGT GAT TAT AAT GTT AAT 

Asn Tyr lie Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 
115 120 125 



GTA GTA GAA TCA GCA ATA AAT GAT ACA CTT GAA TCA TTT AGA GAA AAC 
55 Ser Ala Hi Asn As P Thr Leu G * u Ser phe Arg Glu Asn 



CAA AGA GAA GAA AAT CCT GAA CTT ATA ATT GAT GAT ATT GTA AAC ACA 
Gin Arg Glu Glu Asn Pro Glu Leu He He Asp Asp lie Val Lys Thr 
195 200 205 



48 



96 



144 



TTA AAA GAT ATA AAT AGT TTA ACA GAT ATT TAT ATA GAT ACA TAT AAA 19- 
Leu Lys Asp He Asn Ser Leu Thr Asp He Tyr He Asp Thr Tyr Lys 



240 



55 60 

AAA TCT ; GGT AGA AAT AAA GCC TTA AAA AAA TTT AAG GAA TAT CTA GTT 
Lys Ser Giy Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 
65 70 75 80 

ACA GAA GTA TTA GAG CTA AAG AAT AAT AAT TTA ACT CCA GTT GAG AAA 2B8 
Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lvs 
85 90 95 

AAT TTA CAT TTT GTT TGG ATT GGA GGT CAA ATA AAT GAC ACT G^T ATT Tl* 
Asn Leu His Phe Val Trp He Gly Giy Gin He Asn Asp Thr Ala He 
100 105 no 



384 



GTT TTT TAT GAT AGT AAT GCA TTT TTG ATA AAC ACA TTG AAA AAA ACT 432 
Val Phe Tyr Asp Ser Asn Ala Phe Leu He Asn Thr Leu Lvs Lys Thr 
130 135 140 



480 



TTA AAT GAC CCT AGA TTT GAC TAT AAT AAA TTC TTC AGA AAA CGT ATG 528 
Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
165 170 175 

GAA ATA ATT TAT GAT AAA CAG AAA AAT TTC ATA AAC TAC TAT AAA GCT 
Glu He He Tyr Asp Lys Gin Lys Asn Phe He Asn Tyr Tyr Lys Ala 
180 185 190 



576 



624 
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' SISI S !il S5 « s £ - - « & s „ gj 

■ffi a; sj S2 s k'k s sg Is ai sir 
a; as e as at is & - ss a; ™ s ^ k & S 

250 255 

3J StS US SI? SK. » ™ - ™. E K s E « ™ ™ 
iS - ES2!" SK«s- ™™ 51 2S 
S2 E S S5 S SK S JS E K SK S S as K s 
S SS £S; s; E - - k £ S E «j - ;p 

.« ~ - i ^ - - as si 5 s a a: 5 
si? s as as' ai sir a: x s £ « - - a S e 

345 350 

- - Is - k - - £ e ss si aj £ si? «s k- 
s s a as sir s s m si s £ 5 ?n s: is 

J/i> 380 

CAA GGG CTA ATT TCT GTG AAA GAC TCA TAT TTT *nn ^ 
OXn « y Leu ix. Ser v.X Lys Asp s « ™ ™ £J JJ* ™ ATA CTA 

395 400 

k/sk s as S2-25 5? s: SIS is K ™ - - 

410 415 
GCT ATT AGC GAG GAT AAT GAT TTT AAT APT nrn a ™ 

AXa xx. Ser oiu Asp Asn Asp 7»l S S S? ^ S ^ 



425 430 



a; si Hi ™ - - ?j - a; si ss £ si - si? 

440 445 



SK SS £ ?£ Hi 2| si - ™ ™ » -J = n s 

s k as s !z ss s as a: ?s a: s ai ;si as js 

475 4fiO 



672 



7 20 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 
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TTA TTA ATG TTT AAA GAA GGC AGT ATG AAT ATC CAT TTG ATA GAA GCT 1488 
Leu Leu Met Phe Lys Glu Gly Ser Met Asn lie His Leu lie Glu Ala 
485 490 495 

GAT TTA AGA AAC TTT GAA ATG TCT AAA ACT AAT ATT TCT IcAA TCA ACT 1536 
Asp Leu Arg Asn Phe Glu lie Ser Lys Thr Asn lie Ser Gin Ser Thr 
500 505 510 

GAA CAA GAA ATG GCT AGC TTA TGG TCA TTT GAC GAT GCA AGA GCT AAA 1584 
Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
515 520 525 



GCT CAA TTT GAA GAA TAT AAA AGG AAT TAT TTT GAA GGT TCT CTT GGT 1632 
Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Gly 
1> 530 535 540 



GAA GAT GAT AAT CTT GAT TTT TCT CAA AAT ATA GTA GTT GAC AAG GAG 1680 
Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn lie Val Val Asp Lys Glu 
545 550 555 560 

TAT CTT TTA GAA AAA ATA TCT TCA TTA GCA AGA AGT TCA GAG AGA GGA 1728 
Tyr Leu Leu Glu Lys lie Ser Ser Leu Ala Arg Ser Ser Glu Arg Gly 
565 570 575 



-3 TAT ATA CAC TAT ATT GTT CAG TTA CAA GGA GAT AAA ATT AGT TAT GAA 17 76 

Tyr He His Tyr lie Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 



GCA GCA TGT AAC TTA TTT GCA AAG ACT CCT TAT GAT AGT GTA CTG TTT 1824 
Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 
595 600 605 



CAG AAA AAT ATA GAA GAT TCA GAA ATT GCA TAT TAT TAT AAT CCT GGA 18 72 

Gin Lys Asn lie Glu Asp Ser Glu lie Ala Tyr Tyr Tyr Asn Pro Gly 
->> 610 615 620 



GAT GGT GAA ATA CAA GAA ATA GAC AAG TAT AAA ATT CCA AGT ATA ATT 192 0 

Asp Gly Glu lie Gin Glu He Asp Lys Tyr Lys He Pro Ser He lie 
625 630 635 640 

TCT GAT AGA CCT AAG ATT AAA TTA ACA TTT ATT GGT CAT GGT AAA GAT 196 8 

Ser Asp Arg Pro Lys lie Lys Leu Thr Phe He Gly His Gly Lys Asp 

645 650 655 



4> GAA TTT AAT ACT GAT ATA TTT GCA GGT TTT GAT GTA GAT TCA TTA TCC . 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
66 0 66 5 6 70 



>016 



ACA GAA ATA GAA GCA GCA ATA GAT TTA GCT AAA GAG GAT ATT TCT CCT 2 06 4 

>0 Thr Glu He Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 

675 680 685 

AAG TCA ATA GAA ATA AAT TTA TTA GGA TGT AAT ATG TTT AGC TAC TCT 2112 

Lys Ser He Glu He Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 

^ , 690 695 700 

ATC AAC GTA GAG GAG ACT TAT CCT GGA AAA TTA TTA CTT AAA GTT AAA 2160 

He Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lvs Val Lys 
7 <>5 710 715 " 7201 



GAT AAA ATA TCA GAA TTA ATG CCA TCT ATA AGT CAA GAC TCT ATT ATA 2 2 08 

Asp Lys lie Ser Glu Leu Met Pro Ser He Ser Gin Asp Ser He He 
725 730 735 



63 GTA AGT GCA AAT CAA TAT GAA GTT AGA ATA AAT AGT GAA GGA AGA AGA 2 2 56 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arg Arg 
740 745 750 
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915 " lyr Ala Asn Hi s He Thr Glu Glu 

925 

111 IS £ys lie i£ £y JS JS E f T GTA ^ AAG TTA 

930 , Y y ™5 Ile Phe As P Thr Val Asn Gly Lys Leu 

1 s " 940 
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985 99Q 

leu Phi ser ClJ ?™ GAT GCA GCC ^ GTT GTT 

j-uuo 1005 



2448 



2496 



2 54 4 



2592 



2640 



2688 



2736 



GAG 2784 



2832 



2880 



2928 
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GAA TTA GTA TCA ACT GCA TTA GAT GAA ACT ATA GAC TTA CTT CCT ACA 
Glu Leu Val Ser Thr Ala Leu Asp Glu Thr lie Asp Leu Leu Pro Thr 
1010 1015 1020 

TTA TCT GAA GGA TTA CCT ATA ATT GCA ACT ATT ATA GAtI GGT GTA AGT 
Leu Ser Glu Gly Leu Pro He lie Ala Thr lie He Asp Gly Val Ser 
1025 1030 1035 104O 

TTA GGT GCA GCA ATC AAA GAG CTA AGT GAA ACG AGT GAC CCA TTA TTA 
Leu Gly Ala Ala He Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 
1045 1050 1055 



1070 



3072 



3120 



3168 



AGA CAA GAA ATA GAA GCT AAG ATA GGT ATA ATG GCA GTA AAT TTA ACA 3^16 
Arg Gin Glu He Glu Ala Lys lie Gly He Met Ala Val Asn Leu Thr 
1 ^ 1O60 1065 



3264 



ACA GCT ACA ACT GCA ATC ATT ACT TCA TCT TTG GGG ATA GCT AGT GGA 
Thr Ala Thr Thr Ala He lie Thr Ser Ser Leu Gly He Ala Ser Glv 
1075 1080 108S 

TTT AGT ATA CTT TTA GTT CCT TTA GCA GGA ATT TCA GCA GGT ATA CCA 3312 
Phe Ser He Leu Leu Val Pro Leu Ala Gly He Ser Ala Gly lie Pro 
1090 1095 iioo 

AGC TTA GTA AAC AAT GAA CTT GTA CTT CGA GAT AAG GCA ACA AAG GTT 3 360 

Ser Leu Val Asn Asn Glu Leu Val Leu Arg Asp Lys Ala Thr Lvs Val 
1105 IHO 1115 1120 

GTA GAT TAT TTT AAA CAT GTT TCA TTA GTT GAA ACT GAA GGA GTA TTT 3408 
Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
ll 25 1130 H35 

ACT TTA TTA GAT GAT AAA ATA ATG ATG CCA CAA GAT GAT TTA GTG ATA 
Thr Leu Leu Asp Asp Lys He Met Met Pro Gin Asp Asp Leu Val He 
• H40 H45 iiso 

TCA GAA ATA GAT TTT AAT AAT AAT TCA ATA GTT TTA GGT AAA TCT GAA 
Ser Glu He Asp Phe Asn Asn Asn Ser He Val Leu Glv Lys Cvs Glu 
II 55 H60 1165 

ATC TGG AGA ATG GAA GGT GGT TCA GGT CAT ACT GTA ACT GAT GAT ATA 
He Trp Arg Met Glu Gly Gly. Ser Gly His Thr Val Thr Asp Asp He 
1170 1175 H80 

4:> GAT CAC TTC TTT TCA GCA CCA TCA ATA ACA TAT AGA GAG CCA CAC TTA 

Asp His Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro Hi*? Leu 
1185 H90 H9S " 1200 

TCT ATA TAT GAC GTA TTG GAA GTA CAA AAA GAA GAA CTT GAT TTG TCA 364 8 

W Ser He Tyr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 

1205 1210 1215 

AAA GAT TTA ATG GTA TTA CCT AAT GCT CCA AAT AGA GTA TTT GCT TGG 36 96 

Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
X \ l 220 1225 1230 

GAA ACA GGA TGG ACA CCA GGT TTA AGA AGC TTA GAA AAT GAT GGC ACA 3744 
Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Gly Thr 
1235 1240 1245 

AAA CTG TTA GAC CGT ATA AGA GAT AAC TAT GAA GGT GAG TTT . TAT TGG 
Lys Leu Leu Asp Arg He Arg Asp Asn Tyr Glu Gly Glu Phe Tvr Trp 
1250 1255 126O ' 

AGA TAT TTT GCT TTT ATA GCT GAT GCT TTA ATA ACA ACA TTA AAA CCA 
Arg Tyr Phe Ala Phe He Ala Asp Ala Leu He Thr Thr Leu Lys Pro 
1265 1270 1275 1280 



3456 



3504 



3552 



3600 



3792 



3840 
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AAT AGT GTG CAT TTA GAT GAA AGT GGA GTA GCT GAG ATT TTG AAG TTC 
Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu He Leu Xys Phe 
1555 1560 1565 

ATG AAT AGA AAA GGT AAT ACA AAT ACT TCA GAT TCT TTA ATG AGC TTT 
Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 1580 

TTA GAA AGT ATG AAT ATA AAA AGT ATT TTC GTT AAT TTC TTA CAA TCT 
Leu^Glu Ser Met Asn He Lys Ser He Phe Val Asn Phe Leu Gin Ser 
1585 1590 1595 1600 

AAT ATT AAG TTT ATA TTA GAT GCT AAT TTT ATA ATA AGT GGT ACT ACT 
Asn lie Lys Phe He Leu Asp Ala Asn Phe He He Ser Gly Thr Thr 
1605 i6io 1615 

TCT ATT GGC CAA TTT GAG TTT ATT TGT GAT GAA AAT GAT AAT ATA CAA 
Ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn He Gin 
1620 1625 1630 

CCA TAT TTC ATT AAG TTT AAT ACA CTA GAA ACT AAT TAT ACT TTA TAT 
Pro Tyr Phe He Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tvr 
1635 1640 1645 . 

GTA GGA AAT AGA CAA AAT ATG ATA GTG GAA CCA AAT TAT GAT TTA GAT 
Val Gly Asn Arg Gin Asn Met lie Val Glu Pro Asn Tyr Asp Leu Asp 
1650 1655 1660 

GAT TCT GGA GAT ATA TCT TCA ACT GTT ATC AAT TTC TCT CAA AAG TAT 
Asp Ser Gly Asp He Ser Ser Thr Val He Asn Phe. Ser Gin Lvs Tvr 
1665 ^70 1675 ' xJao 

CTT TAT GGA ATA GAC AGT TGT GTT AAT AAA GTT GTA ATT TCA CCA AAT 
Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val He Ser Pro Asn 
1685 1690 1695 

ATT TAT ACA GAT GAA ATA AAT ATA ACG CCT GTA TAT GAA ACA AAT AAT 
He Tyr Thr Asp Glu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 
1 7 °0 1705 1710 

ACT TAT CCA GAA GTT ATT GTA TTA GAT GCA AAT TAT ATA AAT GAA AAA 
Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Asn Glu Lys 
1715 1720 1725 

ATA AAT GTT AAT ATC AAT GAT CTA TCT ATA CGA TAT GTA TGG AGT AAT 
He Asn Val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1730 1735 1740 

GAT GGT AAT GAT TTT ATT CTT ATG TCA ACT AGT GAA GAA AAT AAG GTG 
Asp Gly Asn Asp Phe He Leu Met Ser Thr Ser Glu Glu Asn Lys Val 
1745 17 50 1755 X i 760 

TCA CAA GTT AAA ATA AGA TTC GTT AAT GTT TTT AAA GAT AAG ACT TTG 
Ser Gin Val Lys He Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 
1765 1770 1775 

GCA AAT AAG CTA TCT TTT AAC TTT AGT GAT AAA CAA GAT GTA CCT GTA 
Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro Val 
1780 1785 1790 

AGT GAA ATA ATC TTA TCA TTT ACA CCT TCA TAT TAT GAG GAT GGA TTG 
Ser Glu He He Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Gly Leu 
1795 1800 1805 

ATT GGC TAT GAT TTG GGT CTA GTT TCT TTA TAT AAT GAG AAA TTT TAT 
He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lvs Phe Tvr 
181 ° 1815 i 8 20 ' * 



4704 



4752 



4800 



4648 



4896 



4944 



4992 



5040 



5088 



51.36 



5184 



5232 



5280 



5328 



5376 



5424 



547; 



- 247 - 



10 



20 



25 



40 



45 



65 



WO *^ ? PCT/US97/1S394 



?S iE ™ 12 £ A jg'ss' S!?.Jg S5 2T A j* TAT ATT ««■ 

18-25 y 1830 y teU Ile Tyr Ile Asn Asp 

1835 184' 

» ESJ. ??I SI £ 52 S SS £ !il S K £ SI 

1850 1855 



ss S SI? ^ ss ^ - ~ ~ m « „ « 

1865 1870 



aiI s: s A n - a «s - A r - t ™ ™ 

1875 ibbo P SP Lys Asn ^ T yr Phe 

1880 1885 

25 §£21 SS 5! ™ aj t s s; - - » s «. «, « 
S S ^ SI S SE Eg 2.^.2 « « 
.Si S SI XS E S SJ Hi £™ SK j£ ?n ~ 
- SI 2* S£ 21 - s K S £ SJ S'-s 

M £? 2i gg si si s s ss ~ sj ^ s si°^ sj 

1960 - 1965 



a si s; ts aj s£ %p - - E « E « « 

Ay/b 1980 

£2 S£ S: S£ ^ A I A ^ ^ AAA CAC TAT TTT GAT 

198 5 * . if 90 P ASn LyS His T >' r Asp 

1995 2O00 



2040 20 J 5 

^ Si SJ Si ?n E ^ S « ™ ™ - ™ jj j jjj 

60 fc055 2060 



2080 



5520 



5568 



5616 



5712 



5760 



5808 



5856 



5904 



5952 



6000 



6048 



ss s s; sn s £ ss sj ss k si s; si 

» SI ™ SI M KS SK- KM SJ S SI 
, Si til SI - - J" s SI SI i= Si 2J SgJ S 



6144 



6192 



AAA ATT TAC TAT TTT GAT GAT TCA TTT ACA GCT GTA G-T GGA Trr 
= 6l 5 lie PhS ^? 0 AS P S - »» ".r Ala 5 vI A £J J« ^ 
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Asp ,eu oiu Asp oiy s « ,ys ^ Tyr S ^ ™ Thr S ^ " 88 
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GCA TAT ATA GGT TTG TCA TTA ATA AAT GAT GGT CAA TAT TAT TTT AAT 63 36 

Ala Tyr lie Gly Leu Ser Leu lie Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

GAT GAT GGA ATT ATG CAA GTT GGA TTT GTC ACT ATA AAT^ GAT AAA GTC 
Asp Asp Gly lie Met Gin Val Gly Phe Val Thr lie Asn Asp Lys Val 
2115 2120 212S 



-5 AG A GTT GGG GAA GAT GTA TAT TAT TTT GGA GAA ACA TAT ACA ATT GAG 

Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr He Glu 
2195 2200 2205 



4^ TTT GGT TAT ATA AAT ATA GAA GAT AAG ATG TTC TAT TTT GGT GAA GAT 

Phe Gly Tyr He . Asn lie Glu. Asp Lys Met Phe Tyr Phe Gly Glu Asp 
2275 2280 2285 



AAC TAT ACT GGT TGG TTA GAT TTA GAT GAA AAG AGA TAT TAT TTT ACA 
Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tvr Tyr Phe Thr 
2325 2330 * 2335 



TAG 



6364 



TTC TAC TTC TCT GAC TCT GGA ATT ATA GAA TCT GGA GTA CAA AAC ATA 64 32 

Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser Gly Val Gin Asn He 
2130 2135 2140 



GAT GAC AAT TAT TTC TAT ATA GAT GAT AAT GGT ATA GTT CAA ATT GGT 64 8 0 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin He Glv 
! ~ 2145 2150 2155 2160 

GTA TTT GAT ACT TCA GAT GGA TAT AAA TAT TTT GCA CCT GCT AAT ACT 6 528 

Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
2165 2170 2175 



GTA AAT GAT AAT ATT TAC GGA CAA GCA GTT GAA TAT AGT GGT TTA GTT 6 5 76 

Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu Tyr Ser Gly Leu Val 
2180 2185 2190 



6624 



ACT GGA TGG ATA TAT GAT ATG GAA AAT GAA AGT GAT AAA TAT TAT TTC 66 7 2 

Thr Gly Trp He Tyr Asp Met Glu Asn Glu Ser Asp Lys Tyr Tvr Phe 
2210 2215 2220 



AAT CCA GAA ACT AAA AAA GCA TGC AAA GGT ATT AAT TTA ATT GAT GAT 6 72 0 

Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly He Asn Leu He Asp Asp 
^ 2225 2230 2235 2240 



ATA AAA TAT TAT TTT GAT GAG AAG GGC ATA ATG AGA ACG GGT CTT ATA 6 76 8 
He Lys Tyr Tyr Phe Asp Glu Lys Gly He Met Arg Thr Gly Leu He 
2245 2250 2255 

TCA TTT GAA AAT AAT AAT TAT TAC TTT AAT GAG AAT GGT GAA ATG CAA 6 816 
Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu Asn Gly Glu Met Gin 
2260 2265 2270 



6864 



GGT GTC ATG CAG ATT GGA GTA TTT AAT ACA CCA GAT GGA TTT AAA TAC 6 912 

Gly Val Met Gin He Gly Val Phe Asn Thr Pro Asp Gly Phe Lys Tvr 
2290 2295 2300 



TTT GCA CAT CAA AAT ACT TTG GAT GAG AAT TTT GAG GGA GAA TCA ATA 6 96 0 

Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe Glu Gly Glu Ser He 
3> 2305 2310 2315 2320 
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GAT GAA TAT ATT GCA GCA ACT GGT TCA GTT ATT ATT GAT GGT GAG GAG 7 056 

Asp Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Gly Glu Glu 
2340 2345 2350 

TAT TAT TTT GAT CCT GAT ACA GCT CAA TTA GTG ATT AGT GAA 7 098 

Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val He Ser Glu 
2355 2360 2365 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2366 amino acids 
IB) TYPE: amino acid 
<D) TOPOLOGY : linear 

Ui) MOLECULE TYPE; protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:10. 

Me. Ser Leu Val Asn Arg Lys Gln Leu Glu Lys Met Ala Asn ftrg 

Phe Arg Thr ci n eia Asp Glu Tyr „ ;J ^ ^ ^ ^ J* ^ 

olu Tyr His Asn Met ser Glu ^ Thr val Lys Z Leu Lys 

Leu Lys Asp Ile Asn ser Leu Thr Asp 1Xe ryr He Jp Thr Tyr Lys 

6 0 

LX| Ser Gly Arg Asn Lys Ala Leu Lys Lys Ph f Lys Glu Tyr Leu ^ 

Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr prQ ^ ^ £ 

90 95 
Asn Leu His P he vai Trp Xle Cly Gly Gin „. Asn Asp ^ Ala 

Asn Tyr a. Asn Gin Trp Lys As p v.l Asn Ser Asp Tyr Asn Val Asn 

12 5 

val Phe Tyr A5p ser Asn ^ phe ^ ^ ^ ^ 

val val Olu ser Ala lie Asn Asp Th r Le u Glu Ser Phe Arg Glu 
Leu Asn Asp Pro Arf Pne Asp Tyr Asn Lys phe phe ftrg ^ ^ ^ 

Clu He lie Tyr Asp Lys Gin Lys Asn Pne He Aan Tyr ^ ^ ftl 

185 190 
Gin Ar 9 alu Glu Asn Pro Clu Leu He Xle Asp Asp Xl e v .l Lys Thr 

u 205 

Tyr Leu Ser Asn Glu Tvr Ser r r-i,, tt 

210 Y LyS Glu Ile As P Glu Leu Asn Thr Tyr 

220 7 

lie Glu Clu Ser Leu Asn Lys He Thr Gin Asn Ser cay Asn Asp v.l 

Arg Asn Phe clu Glu Phe Lys Asn Gl y Glu Ser Phe Asn Leu Tyr Z 

Cln Glu Leu Val Glu Ar g Trp Asn Leu Ala Ala Ala Ser Asp xil Leu 

Arg lie Ser Ala Leu Lys Glu lie Gly Gly Met Tyr Leu Asp val Asp 

Met Leu Pro Gly He Gin Pro Asp Leu Pne Glu Ser xle Glu Lys Pro 

Jyb 300 
Ser ser Val Thr Val Asp Pne Trp Glu Met Thr Lys _ ^ ^ ^ 
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Met Lys Tyr Lys Glu Tyr lie Pro Glu Tyr Thr Ser Glu His Phe Asp 
325 . 330 335 

Met Leu Asp Glu Glu Val Gin Ser Ser Phe Glu Ser Val Leu Ala Ser 
340 345 3SO 

Lys Ser Asp Lys Ser Glu lie Phe Ser Ser Leu Gly Asp Met Glu Ala 
355 360 365 

Ser Pro Leu Glu Val Lys lie Ala Phe Asn Ser Lys Gly lie He Asn 
37 <> 375 380 

Gin Gly Leu He Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu lie Val 
385 390 395 400 

Lys Gin He Glu Asn Arg Tyr Lys He Leu Asn Asn Ser Leu Asn Pro 
405 410 415 

Ala He Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe He 
420 425 430 

Asp Ser He Met Ala Glu Ala Asn Ala Asp Asn Gly Arg Phe Met Met 
435 440 445 

Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
450 455 460 

Thr He Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gin Asp 
465 470 475 480 

Leu Leu Met Phe Lys Glu Gly Ser Met Asn He His Leu He Glu Ala 
485 490 495 

Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn He Ser Gin Ser Thr 
500 505 510 

Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lvs 
515 520 525 

Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Gly 
530 535 540 

Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val Asp Lys Glu 
545 550 555 560 

Tyr Leu Leu Glu Lys He Ser Ser Leu Ala Arg Ser Ser Glu Arg Gly 
565 570 575 

Tyr He His Tyr He Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 

Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Ph^ 
595 600 605 

Gin Lys Asn He Glu Asp Ser Glu He Ala Tyr Tyr Tyr Asn Pro Gly 
610 615 620 

Asp Gly Glu He Gin Glu He Asp Lys Tyr Lys He Pro Ser He He 
"5 630 635 640 

Ser Asp Arg Pro Lys He Lys Leu Thr Phe He Gly His Gly Lvs Asp 
645 650 655 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
660 66S 670 

Thr Glu lie Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 
675 680 685 
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^ til Ue AS " ^ 0l * ^ *«" phe ser Tyr 

* D 700 

lie Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lys Val Lys 
Asp Lys XleSerGlu Leu Met Pro Ser lie Ser Gin Asp Ser xle III 
Val Ser Al a ^ Gln Tyr Glu val ^ ^ ^ 21 Arg 

Glu Leu Leu Asp His Ser G l y Glu Trp Ile Asn Lys Glu Glu Ser He 

/b0 765 
Ue Lys Asp lie S er Ser Lys Glu Tyr U e Ser Phe Asn Pro Lys Glu 

Asn Lys Xle Thr Val Lys Ser Lys Asn Leu Pro Z Leu Ser Thr Leu 

795 800 
Leu Gin Glu Xle Ar f Asn Asn Ser Asn Ser Ser Asp xle clu Leu Glu 

Glu Lys Val Met Leu Thr Glu Cys Glu Xle Asn Val xle Ser Z xle 

Asp Thr Gin He Val Glu Glu ffi Ile Glu Glu Ala Lyg ^ ^ ^ 

««u 845 

Ser Asp ser lie Asn Tyr lie Lys Asp Glu Phe Lys Leu lie Glu Ser 

B " 860 
lie Ser Asp Ala Leu Cys Asp Leu Lys Gin Gin Asn Glu Leu Glu Asp 

Ser „ lS Phe Xle Ser Phe Glu Asp lie Ser Glu Thr Asp Glu Gly Z 

Ser Xle Ar 3 Phe Xle Asn Lys Glu Thr Gly Glu Ser Xle Phe lZ Glu 

905 910 

Thr Glu Lys T hr Xle Phe Ser Glu Tyr Ala Asn His xle Thr Clu Glu 

y^o 925 

Ho ser Lys lie Lys Gly Thr lie Phe Asp Thr Val Asn Gly Lys Leu 

y f 5 940 
Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr Leu Asn 

955 960 
Ala Ala Phe Phe Xle Gin Ser Leu He Glu Tyr Asn Ser Ser Lys Glu 

Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gin Val Tyr 12 Gin 

985 990 

Leu Phe ser Thr Gly Leu Asn Thr xle Thr Asp Ala Ala Lys Val Val 

iouo 1005 



Glu Larval ser Thr Ala Leu Asp Glu Thr Xle Asp Leu Leu Pro Thr 

iUib 1020 

!0^S Ser Gly L6U llto 1 ^ Thr "« Ile A =P ^1 Ser 

1035 1040 
Leu Gly Ala Ala xle Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 
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Arg Gin Glu lie Glu Ala Lys lie Gly He Met Ala Val Asn Leu Thr 
1060 1065 1070 

Thr Ala Thr Thr Ala He He Thr Ser Ser Leu Gly He Ala Ser Gly 
1075 1080 i 085 1 

Phe Ser He Leu Leu Val Pro Leu Ala Gly He Ser Ala' Gly He Pro 
1090 1095 iioo 

^. r , Leu Val Asn Asn Glu Leu Val Leu Arg Asp Lys Ala Thr Lys Val 
1105 1110 HIS 1120 

Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
H2S H30 1135 

Thr Leu Leu Asp Asp Lys He Met Met Pro Gin Asp Asp Leu Val He 
1140 1145 H50 

Ser Glu He Asp Phe Asn Asn Asn Ser He Val Leu Gly Lys Cys Glu 
1155 1160 1165 

lie Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asp He 
1170 H75 iiso 

Asp His Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro His Leu 
1185 1X 9<> 1195 1200 

Ser He Tyr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 
1205 1210 1215 

Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
1220 1225 1230 

Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Glv Thr 
1235 1240 1245 

Lys Leu Leu Asp Arg He Arg Asp Asn Tyr Glu Glv Glu Phe Tyr Trp 
1250 1255 1260 

Arg Tyr Phe Ala Phe He Ala Asp Ala Leu He Thr Thr Leu Lys Pro 
1265 1270 1275 y 1280 

Arg Tyr Glu Asp Thr Asn He Arg He Asn Leu Asp Ser Asn Thr Arg 
1285 1290 1295 

Ser Phe He Val Pro lie He Thr Thr Glu Tyr He Arg Glu Lys Leu 
I 300 1305 1310 

Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 
1315 1320 1325 

Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Trp 
1330 1335 1340 

He lie Asp Val Asp Asn Val Val Arg Asp Val Thr He Glu Ser Asp 
1345 1350 1355 i3 6 o 

Lys He Lys Lys Gly Asp Leu He Glu Gly He Leu Ser Thr Leu Ser 
1365 1370 1375 

He Glu Glu Asn Lys He He Leu Asn Ser His Glu He Asn Phe Ser 
i3 60 1385 1390 

Gly Glu Val Asn Gly Ser Asn Gly Phe Val Ser Leu Thr Phe Ser He 
I 395 1400 1405 

Leu Glu Gly He Asn Ala He He Glu Val Asp Leu Leu Ser Lys Ser 
14i0 1415 1420 
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Ty^Lys Leu Leu Il.-J.r.aiy Glu Leu L ys lle^u Met ' Leu Asn r 
Asn Hi, Ile Oln jln^y. Ile Asp Tyr Il^Ol"^ As „ fe , olu^eT 

T Ji.V" Phe ^ 5 -P GXu Gly LysoZ\ sn 

Gly Phe Xle^sn Gly Ser Thr Ly^GIualy Leu Phe -iLTgXu Leu 
Pro j.pv.1 VaX Leu IXe Ser^s Val; Tyr Met As^Ser Lys p „ 
Se^Phe oiy Tyr Tyr' J.rA«„ Asn LeU Lys J-P^.X .Ly- VaX lie Thr 
.ys Asp Asn val ^ „. Leu Thr My Tyr^Leu Lys Asp ' 

Lys lie Ser Uu.er Leu Thr Leu G1 Asp Glu Lyg ^ ^ 

iMb 1550 
rt^S HiS ^ ^ G1U Val A Xa CXu IXe.u Lys Phe . 

Met AsnArg Lys Gly ta . T ^ Asn Thr Ser Asp ^^^^ ^ 

lies 01 " AS " llt^s S-r IXe Phe Val Asn Phe Leu Gln Sfir 

1595 1600 

*" ^' S 5 UB Ma A,B S,"' »« °1V ^Thr 

Ser XXe Gly Gl^Phe Glu Phe XXe Cys^sp CXu Asn Asp A sn^ I Xe ^GXn 

Pro Tyr Phf Ile Lys Phe Asn Thr L.« «„ Thr A =n T y r Thr^Leu Tyr 

1640 1645 

Val Gly Asn Arg Gln Asn Met lie Val Gln pk. ft 

1650 lfi cc aA G1U Pro Asn Asp Leu Asp 

i&b5 1660 

xII 5 Ser Gly A =P Thr VaX lie As^Phe Ser Gln Lys Tyr 

-u Tyr OXy Il. J-j Ser C ys VaX Asn ^ V al Val Ile Ser prQ ^ 

1690 16g5 
He Tyr Thr £poiu XXe Asn XXe Thr Pro Val Tyr Glu Thr Asn ^ 

1/05 1710 
Thr Tyr p„ clu val XXe VaX Leu Asp Ala Asn Tyr IXe Asn GXu Lys 

1 i 1725 
XXe Asn 0 val Asn XXe Asn Asp Ser IXe Arg Tyr Val Trp Ser Asn 

X/Jb 1740 
A3P 5 Gly ABn ASP Phe Il^Leu Met Ser Thr Ser^Glu GXu Asn Lys Val 

Ser Gln VaX Lys II^Arg Phe VaX .sn Va^Phe Lys Asp Lys ^/J" 

AXa Asn Lys Leaser P he Asn Phe Ser Asp Lys Gln As P Val ^^1 

1785 179Q 
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Ser Glu He He Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Gly Leu 
1795 1800 1805 

He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu .Lvs Phe Tyr 
1810 1815 1820 1 

He Asn Asn Phe Gly Met Met Val Ser Gly Leu lie Tyr He Asn Asp 
182 5 1830 1835 1840 

Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1845 1850 1855 

Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro He Asn Gly Gly 
1860 1865 1870 

Ala Ala Ser He Gly Glu Thr He He Asp Asp Lys Asn Tyr Tyr Ph*» 
1875 1880 1885 

Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe Ser Thr Glu Asp Gly 
1890 1895 1900 

Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp. Glu Asn Leu Glu Gly 
1905 1910 1915 1920 

Glu Ala lie Asp Phe Thr Gly Lys Leu He He Asp Glu Asn He Tyr 
1925 1930 1935 

Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu Trp Lys Glu Leu Asn 
1940 1945 1950 

Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly Lys Ala Phe Lvs Gly 
1955 i960 1965 

Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe Asn Ser Asp Gly Val 
1970 1975 1980 

Met Gin Lys Gly Phe Val Ser He Asn Asp Asn Lys His Tyr Phe Asp 
1985 1990 1995 2000 

Asp Ser Gly Val Met Lys Val Gly Tyr Thr Glu He Asp Gly Lys His 
2005 2010 2015 

Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He Gly Val Phe Asn Thr 
2020 2025 2030 

Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn Glu Asp Leu Glv Asn 
2035 2040 2045 

Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He Leu Asn Phe Asn Asn 
2050 2055 2060 

Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala Val Val Gly Trp Lys 
2065 2070 2075 2080 

Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp Glu Asp Thr Ala Glu 
2085 2090 2095 

Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

Asp Asp Gly He Met Gin Val Gly Phe Val Thr Ho Asn Asp Lvs Vai 
2115 2120 2125 

Phe Tyr Phe Ser Asp Ser Gly He J le Glu Ser Gly Val Gin Asn He 
2130 21^.5 2140 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin He Gly 
2145 2150 2155 2160 

Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
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Val Asn Asp Asn lie Tyr Gly Gin Ala Va1 ,~i „ 

2180 y ° In aJ" Val Glu T Y* Ser Gly Le U Val 

" 9 "* l 2?."" A ' p T - SS.- v- sj* 

Thr s^o^ T - «~ « ~ «. fc 
Ss p» ... ™, , y « g. „. cyo , y , QIy u> ^ ^ 

», ^v. T„r Tyi - jj.^ Qlu Lys oly ^ ^ »« 

2255 

Ser Phe Glu Asn Asn Asn Tvr t,^ r,v. 

2260 Tyi Tyr G1 " A S n Gly Glu Met G l n 

" 65 2270 

" h " ° ly I£s"* 5° 0 <-y «« «« w olu A , p 

••r «„ „. „ y » , ■ Thr pre ™" 

" 2 3 00 ' y 

«J «• «». a,, As „ ^ Asp 0iu ^ n ^ 0( (h ^ 

«. ^ Tnr „ y A=p ^ s , p t ^ ^ ^ ^ 

«P «» Ty, „. „, Ala T1>r „„ „ Uo u . A=p ^ 

^ J4b 2350 
Tvr xyr Ph^Asp Pro As p Thr ^ ^ ^ ^ ^ 



2360 2365 



* " > ^FORMATION FOR SEQ ID NO: 11 : 

ii) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 19 base pairs 
<B> TYPE: nucleic acid 
4S (C> STRANDEDNESS : single 

"» TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 (Xi) SEQUENCE DESCRIPTION : SEQ ID NO:ll ; 

TAGAAAAAAT GGCAAATGT 

.(2) INFORMATION FOR SEQ ID NO: 12: 
~"° (i> SEQUENCE CHARACTERISTICS 

(A) LENGTH: 21 base pairs 
<B) TYPE : nucleic acid 
(C) STRANDEDNESS: single 
6() (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTTCATCTTG TAGACTCAAA G 
<2) INFORMATION FOR SEQ ID NO: 13: 

7() (i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
GATGCCACAA GATGATTTAG TG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE; nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

CTAATTGAGC TGTATCAGGA TC 

(2) INFORMATION FOR SEQ ID NO: 15: 

<ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGGAATTCCT AGAAAAAATG GCAAATG 

(2) INFORMATION FOR SEQ ID NO : 16 : 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GCTCTAGAAT GACCATAAGC TAGCCA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

CGGAATTCGA GTTGGTAGAA AGGTGGA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
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(B) TYPE: nucleic acid 

<C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO* 18: 
CGGAATTCGG TTATTATCTT AAGGATG 
f2> INFORMATION FOR SEQ ID NO: 19: 



PCT/US97/15394 



35 



40 



45 



50 



55 



65 



70 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 
' Leu Ile Tnr Gly ?e vai ^ ^ ^ ^ ^ ^ 

Pro He Asn Gly Gly Ala Ala Ser lie Gly Glu Thr lie lie Asp 

LYS Asa Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe 

* u 45 
ser Thr Glu Asp Gly phe ^ phe ^ ^ ^ ^ ^ ^ 

bb 60 K 

Glu Asn Leu Glu Gly Glu Ala lie Asp Phe Thr Gly Lys Lea He U e 

75 80 
Asp Glu Asn lie Tyr Tyr Phe Asp Asp Asn Tyr Ar 9 Gly Ala Val Glu 

Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly 

LV3 Ala P h| Lys Gly Leu Asn Gin Xle Gly Asp Tyr Lys Tyr Tyr Phe 

Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser lie Asn Asp Asn 

135 140 
His T yr Phe Asp Asp Ser Gly Vai Met Lys Val Gly Tyr Thr Glu 

He Asp Gly Lys jio Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin III 

Gly val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His „ is Asn 
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<i) SEQUENCE CHARACTERISTICS- 
IS '*> LENGTH: 28 base pairs 

< B > TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2() <ii> MOLECULE TYPE: DNA (genomic) 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 19 : 
CGGAATTCTT GATAACTGGA TTTGTGAC 

(2) INFORMATION FOR SEQ ID NO: 20: 2 " 
U) SEQUENCE CHARACTERISTICS- 

!b! typ1 TH: 511 amino acids 

*() " - ■ \o) TYPE : amino acid 

• jC) STRANDEDNESS: unknown 

<D> TOPOLOGY: unknown 
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in 



15 



30 



45 



50 



180 185 



190 



Glu Asp Leu Gly Asri Glu Glu Gly Glu Glu lie Ser Tyr Ser Gly He 
195 200 , 205 

Leu Asn Phe Asn Asn Lys lie Tyr Tyr Phe Asp Asp Ser Phe Thr Ala 
a1 ° 215 220 

Val Val Gly Trp Lys Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp 

Glu Asp Thr Ala Glu Ala Tyr He Gly Leu Ser Leu He Asn Asp Glv 

245 25f tr r 



255 



Gin Tyr Tyr Phe Asn Asp Asp Gly H e Met Gin Val Gly Phe Val Thr 
260 265 270 

He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser 
275 280 285 

^ I1S ASP ASP ASn Tyr Phe '*Y* Ile A ^P Asp Asn Gly 

295 300 

He Val Gin Ile Gly Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe 

' 310 315 3 20 

Ala Pro Ala Asn Thr Val Asn Asp Asn Ile Tyr Gly Gin Ala Val Glu 
32 5 33 0 335 

Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu 
340 345 350 

Thr Tyr Thr lie Glu Thr Gly Trp lie Tyr Asp Met Glu Asn Glu Ser 
355 360 365 

' Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Glv He 
370 375 380 

40 *{|" Leu Ile As P As P Ile L V S Tyr Tyr Phe Asp Glu Lys Gly He Met 

385 390 395 400 

Arg Thr Gly Leu Ile Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu 
405 4!o 415 

Asn Giy Glu Met Gin Phe Gly Tyr Ile Asn Ile Glu Asp Lvs Met Phe 
420 425 4 - 30 

Tyr Phe Gly Glu Asp Gly Val Met Gin He Gly Val Phe Asn Thr Pro 
43 5 440 445 

Asp Gly Phe Lys Tyr Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe 
450 455 460 

55 Glu Gly Glu Ser He Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys 

470 475 48O 

Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val He 
465 490 495 

60 Ile * Sp Glu Glu Tyr Phe Asp Pro Asp Thr Ala Gin Lou 

500 505 510 
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(2) INFORMATION FOR SEQ ID NO: 21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Glu Glu Asn Lys Val Ser Gin Val Lys He Arg Phe Val Asn Val 

5 10 15 

Phe Lys Asp Lys Thr Leu Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp 

25 30 

Lys Gin Asp Val Pro Val Ser Glu He He Leu Ser Phe Thr Pro Ser 

Tyr Tyr Glu Asp Gly Leu lie Gly Tyr Asp Leu Gly Leu Val Ser Leu 

55 60 

Tyr Asn Glu Lys Phe Tyr lie Asn Asn Phe Gly Met Met Val Ser Gly 

75 80 
Leu lie Tyr He Asn Asp Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn 



90 



95 



Asn Leu lie Thr Gly Phe Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe 

105 110 

Asn Pro lie Asn Gly Gly Ala Ala Ser He Gly Glu Thr lie He Asp 

120 125 
Asp Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val 



140 



Phe Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu 

150 155 1«0 

Asp Glu Asn Leu Glu Gly Glu Ala He Asp Phe Thr Gly Lys Leu He 

165 17 0 175 

lie Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Glv Ala Val 



190 



Glu Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr 



205 



Gly Lys Ala Phe Lys Gly Leu Asn Gin lie Gly Asp Tyr Lvs Tyr Tvr 

215 220 ' 

Phe Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He A Sn Asp 

230 2 35 2 40 

Asn Lys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tyr Thr 
45 250 255 

Glu lie Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin 

265 270 

lie Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His Hi, 
275 280 265 

Asn Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu He Scr Tyr Ser Glv 
zyu 295 3 00 * 

He Leu Asn^Phe Asn Asn Lys He. Tyr Tyr Phe Asp Asp Ser Phe Thr 
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305 310 



315 320 



Ala Val Val Gly Trp Lys Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe 
325 330 ( 335 

Asp Glu Asp Thr Ala Glu Ala Tyr lie Gly Leu Ser Leu lie Ash Asd 
340 345 350 

Gly Gin Tyr Tyr Phe Asn Asp Asp Gly He Met Gin Val Gly Phe Val 
355 360 365 

Thr He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He lie Glu 
370 375 380 

Ser Gly Val Gin Asn He Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn 
385 390 395 400 

Gly He Val Gin He Gly Val Phe Asp Thr Ser Asp Gly Tyr Lvs Tyr 
405 410 4 i 5 

Phe Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val 
420 425 430 

Glu Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly 
435 440 445 

Glu Tnr Tyr Thr He Glu Thr Gly Trp He Tyr Asp Met Glu Asn Glu 
450 455 460 

Ser Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Glv 
465 470 475 Y 480 

He Asn Leu He Asp Asp He Lys Tyr Tyr Phe Asp Glu Lys Gly He 
485 490 495 

. Met Arg Thr Gly Leu He Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn 
500 505 510 

Glu Asn Gly Glu Met Gin Phe Gly Tyr He Asn He Glu Asp Lys Met 
515 520 525 

Phe Tyr Phe Gly Glu Asp Gly Val Met Gin He Gly Val Phe Asn Thr 
530 535 540 

Pro Asp Gly Phe Lys Tyr Phe Ala His Gin Asn Thr Leu Asp Glu Asn 
545 550 555 560 

Phe Glu Gly Glu Ser lie Asn Tyr Thr Glv Trp Leu Asp Leu Asp Glu 
565 570 575 

Lys Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val 
580 565 590 

He He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
595 600 605 

\2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 0 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

( B ) LOCATION: 1 . . 1314 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO * 22 • 

5 ' E SI 5S E E E E S S ^ffi S EE E 

m EE e e e ee ~ « e e E e e e as 

E E 2S. SE S E E E EE EE E SITE E 



15 



45 



E S tl? 2S E E E E E E E E 22 E E E 
» E X SIT E EE E S E Sit EE E E E 
M . .SK E E E E E E E ilS E S E E E E E 
^ E E E E E E E E E E E E ^ E E E 
E E ^ & E E E E E 25 SE E S S 

120 125 



EE E EE a: E E E E SI Sit E E E E 

135 14Q 



- E E E E E E E E E E E E E E SIT E 
4J E E E E EE E E E E E E E E E E 
. K.K.EEEEEEEEEEEEEE 

50 185 190 

E E E E E E S E E E E E E E E E 
55 E E E E E E E E E E E E 5 E E E 
« E E E E E E E E E E E E E E E E 

235 2 

„■ E E E E E E E E E E E S£ E E E E 
E E E E E E E E SIT E E E E Sit E E 

70 265 270 



48 



96 



144 



192 



240 



288 



336 



364 



432 



460 



526 



576 



624 



672 



720 



76 8 



816 
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CGC GGT TAC ATG TAC CTG AAA GGT CCG CGT GGT TCT GTT ATG ACT ACC 864 
Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 
275 280 285 

AAC ATC TAC CTG AAC TCT TCC CTG TAC CGT GGT ACC AAA TTC ATC ATC 912 
Asn lie Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe He lie 
290 295 300 ' 

AAG AAA TAC GCG TCT GGT AAC AAG GAC AAT ATC GTT CGC AAC AAT GAT 960 
Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn lie Val Arg Asn Asn Asp 
305 310 315 320 

CGT GTA TAC ATC AAT GTT GTA GTT AAG AAC AAA GAA TAC CGT CtG GCT 1008 
Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala 
1> 325 330 335 

ACC AAT GCT TCT CAG GCT GGT GTA GAA AAG ATC TTG TCT GCT CTG GAA 1056 
Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
-> 0 340 345 350 

ATC CCG GAC GTT GGT AAT CTG TCT CAG GTA GTT GTA ATG AAA TCC AAG 1104 
He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 
355 360 365 

2 3 AAC GAC CAG GGT ATC ACT AAC AAA TGC AAA ATG AAT CTG CAG GAC AAC 

Asn Asp Gin Gly He Thr Asn Lys Cys Lys Met Asn Leu Gin Asp Asn 
370 375 380 



30 



40 



50 



60 



65 



70 



TCT CGC ACT CTG . GGT TGC TCT TGG GAG TTC ATC CCG GTT GAT GAC GGT 
Ser Arg Thr Leu Gly Cys. Ser Trp Glu Phe He Pro Val Asp Asp Glv 
420 425 430 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr He Lys Asn He lie 
1 "5 10 15 

Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His L^u He Asp 
20 25 30 

Leu Ser Arg Tyr Ala Ser Lys He Asn He Gly Ser Lys Val Asn Phe 
35 40 45 

Asp Pro He Asp Lys Asn Gin lie Gin Leu Phe Asn Leu Glu Ser Ser 
50 55 60 

Lys lie Glu Val He Leu Lys Asn Ala He Val Tyr Asn Ser Met Tyr 
65 70 75 80 



1152 



AAT GGT AAC GAT ATC GGT TTC ATC GGT TTC CAC CAG TTC AAC AAT ATC 1200 
Asn Gly Asn Asp lie Gly Phe He Gly Phe His Gin Phe Asn Asn lie 
385 390 395 400 

GCT AAA CTG GTT GCT TCC AAC TGG TAC AAT CGT CAG ATC GAA CGT TCC 124 8 

Ala Lys Leu Val Ala Ser Asn Trp Tyr Asn Arg Gin He Glu Arg Ser 
405 410 415 



1296 



TGG GGT GAA CGT CCG CTG TAACCCGGGA AAGCTT 1330 

Trp Gly Glu Arg Pro Leu 
435 

(2) INFORMATION FOR SEQ ID NO: 23: 
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>1U A, n Phe Ser Thr S er Ph e Trp Ile A ^ Ile Pro LysTyr ^ 
Ser lie Ser Leu Asn Asn Glu Tyr Th| He lie >sn C ys Met Cl! Asn 

Asn Ser G ly T rp Lys val Ser Leu Asn Tyr cly Glu Iie Trp Thr 

125 

Leu Gin Asp Thr Gin ox« lie Lys oln Arg Val Val phe Lys ^ 

AJ3 140 
Gin Met He Asn lie Ser Asp Tyr ri e A s„ Arg Trp Ue Phe Val Thr 

He Thr Asn Asn Arg Leu Asn Asn Ser Lys lie Tyr lie Asn Gly Arg 

Leu lie Asp oin L ys Pro He Ser Asn Leu Gly Asn lie „is Ala Ser 

Asn Asn He Met Phe Lys Leu Asp Gly Cys Arg Asp Thr nil Arg Tyr 

^uu 205 
He Trp lie Lys Tyr Phe Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys 

220 

Glu lie Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly H e Leu Lys 
Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met ^ 



250 -> 55 



Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly H 



>70 



Ly Ile 



Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 

' B0 285 
Asn He Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe He lie 

*- yb 300 
Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn lie Val Arg Asn Asn Asp 

Arg val Tyr He Asn Val Val val Lys Asn Lys Glu Tyr Arg Leu lZ 

Thr Asn Ala Ser Gin Ala Gly Val Glu Lys Iie Leu Ser Ala Leu Qlu 

345 350 

He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 

360 365 

Asn Asp Gin Gly He Thr Ann Lys Cys Lys Met Asn Leu Gin Asp Asn 

J75 380 

Asn Gly Asn Asp He Gly Phe Il e Giy Phe His Gln phe ftsn ^ ^ 

395 40Q 

Ala Lys Lfcu val Ala Ser Asn Trp Tyr Asn Arg Gin He Glu Arg Ser 

Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asp Gly 



430 

Trp Gly Glu Arg Pro Leu 
435 

U) INFORMATION FOR SEQ ID NO: 24: 
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<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

ID) TOPOLOGY: linear i 
(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID -NO: 24: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
1 5 10 15 r 

He Glu Gly Arg His Met Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1402 base pairs 
<B) TYPE; nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



tii» MOLECULE TYPE: DNA (genomic) 



fix) FEATURE .:. 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 1386 

(xii SEQUENCE DESCRIPTION : , SEQ ID NO:25: 

ATG GGC CAT CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT 
Met Gly His His His His His His His His His His Ser Ser Gly H^s 
\ 5 10 15 

ATC GAA GGT CGT CAT ATG GCT AGC ATG GCT CGT CTG CTG TCT ACC T^C 
He Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 30 

ACT GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC 
Thr Glu Tyr lie Lys Asn lie lie Asn Thr Ser He Leu Asn Leu Arg 
35 40 45 ^ 

TAC GAA TCC AAT CAC CTG ATC CAC CTG TCT CGC TAC GCT TCC AAA ATC 
Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He 
50 55 60 



48 



96 



144 



192 



SO ^ tT S? T TCT GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC 240 

A f n Ile G1 Y S ^ Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He 

65 70 7 5 80 

CAG CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT 
Gin Leu Phe Asn Leu Glu Ser Ser Lys Ile Glu Val Ile Leu Lys Asn 
85 90 95 

GCT ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG 
Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp 
60 100 105 110 

ATC CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC 
He Arg He Pro Lys Tyr Phe Asn Ser Ile Ser Leu Asn Asn Glu Tvr 
115 120 125 

"5 ACC ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG 

Thr He Ile Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu 
130 135 140 

70 ^ Z AC S? T A T C ATC TGG ACT CTG CAG GAC ACT CAG GAA ATC AAA 4 80 

70 Asn Tyr Gly Glu Ile He Trp Thr Leu Gin Asp Thr Gin Glu lie Lys 



288 



336 



384 



4 32 
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145 ISO 



155 160 



.as .£ si sit s s: ss si ss s tis sss.e ss' is 

170 175 

35 S S M US SI ?SS; tIS S SS SI SS SIS SI SS 

18 5 - - - 



190 



E Si tTS £ ?IS SS Sf S SS tIS SS SS & ss S E 



200 205 



S - SS E SIT Sc. S S SS tIS 3? SIS SS E E SS 



310 

320 



31S 



£ S SS S £ US tIS tIS £ ^ ~ S - - « „. 



330 



335 



365 



390 

400 



395 



Cvs £1? f TG ^ GAC ^ ** r GGT GAT ATC GGT TTC ATC 

Cys Lys Met Asn Leu Gin Asp Asn Asa Gly Asn Asp lie Gly Ph e lie 

405 410 415 



528 



576 



624 



672 



SI K SS 2S tIS SIS SI E SI SS tTS Slj S - ™ - 
SS E,2I SS S SS SS 5? tIS S» tIS 5 SS "5 SI p 
■ SS. 2S £ Stt K «j «j - Z «= ™ « « Z 

SS E SI E SS tIS EE S5:.-2| ™ »S .2? SS SS SS 
SS SS £i SSS SS S? « S SS 2* EjS SS £ SS 
SH SS 51? SS SI St St! 35 « ?5 - ™ ™= « ™ „ .„ 

295 300 



720 



768 



816 



864 



960 



1008 



■2? i= tIS SI 2S SS SI SI 2J Sit SS - - SI St ST »« 

345 35Q 

£S SS % St* SS S SS SI jg "I SI E SS SI SS St »« 

- 3 36 0 - - 



SS £ tIS IIS E SI SIS as tIS SS SS SIT SS SI SIS E »» 
SS St SIT St E e e ss SS SS SS ss SS s SS & 



1200 



1248 
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GGT TTC CAC CAG TTC AAC AAT ATC GCT AAA CTG GTT GCT TCC AAC TGG 12 96 

Gly Phe His Gin Phe Asn Asn lie Ala Lys Leu Val Ala Ser Asn Trp 
420 425 430 

TAC AAT CGT CAG ATC GAA CGT TCC TCT CGC ACT CTG GGT TGC TCT TGG 1344 
Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys Ser Trp 
435 440 445 



445 

GAG TTC ATC CCG GTT GAT GAC GGT TGG GGT GAA CGT CCG CTG 
Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
450 455 460 

TAACCCGGGA AAGCTT 

(2) INFORMATION FOR SEQ ID NO: 26: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Gly His His His His His His His His His His Ser Ser Civ His 
1 5 10 15 

lie Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 jo 

Thr Glu Tyr He Lys Asn He He Asn Thr Ser lie Leu Asn Leu Arg 
35 40 45 

Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys lie 
50 55 eo 

Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He 
65 .70 75 80 

Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val lie Leu Lys Asn 
85 90 95 

Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Scr Thr Ser Phe Trp 
100 105 no 

He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tvr 
115 120 12 5 

Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu 
130 135 i40 

Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu lie Lys 
145 150 155 

Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr 
165 170 175 

He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn 
180 185 190 



1386 



140; 
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Ser Lys lie Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He S er 

2 00 2015 

Asa Leu Gly Asn ll e His Ala Ser Asn Asn lie Met Phe Lys L eu Asp 

215 220 | K 

Oly Cys Arg Asp Thr His Arg Tyr Il e Trp „. Lys Tyr Phe 

Phe Asp Lys Glu Le Asn Glu Lys Glu lie Lys Asp Leu Tyr Asp Asn 

~ 50 255 
Gin Ser Asn Ser Gly Ile Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 

> 65 2 7a 

Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lvs Tyr 

280 285 
Val Asp val Asn Asn Val Gly lie Arg Gly Tyr Met Tyr Leu Lys Gly 

295 300 
Pro Arg Gly Ser Val Met Thr Thr Asn lie Tyr Leu Asn Ser Ser Leu 



320 

Tyr Arg Gly Thr- Lys Phe lie He Lys Lys Tyr Ala Ser Gly Asn Lys 

Asp Asn lie Val Arg Asn Asn Asp Arg Val Tyr lie Asn Val Val Val 
340 345 350 



Lvs Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Glv Val 

360 365 
Glu Lys He Leu Ser Ala Leu Glu lie Pro Asp Val Glv Asn Leu Ser 



37 5 380 



Gin val Val Val Met Lys Ser Lys Asn Asp Gin Gly lie Thr Asn Lvs 

390 400 

cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe lie 

405 110 415 

Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp 

425 430 

Tyr Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cv3 Ser Trp 

440 445 ' 

Glu Phe ile Pro Val Asp Asp Gly Trp Gly Glu Arq Pro Leu 

455 460 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS 

<A> LENGTH: 38 91 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE : 

<A> NAME/ KEY: CDS 

<B) LOCATION: 1..3888 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG CAA TTT GTT AAT AAA CAA TTT AAT TAT AAA GAT CCT GTA AAT GGT 
Met Gin Phe Val Asn Lys Gin Phe Asn.Tyr Lys Asp Pro vll J£ GlJ 

5 10 . 15 

GTT GAT ATT GCT TAT ATA AAA ATT CCA AAT GTA GGA CAA ATG CAii CCA 
Val Asp He Ala Tyr He Lys He Pro Asn Val Gly Gin Met Gin Pro 
20 25 30 

GTA AAA GCT TTT AAA ATT CAT AAT AAA ATA TGG GTT ATT CCA GAA AGA 
Val Lys Ala Phe Lys He His Asn Lys lie Trp Val He Pro Glu £?J 
35 40 45 

GAT ACA TTT ACA AAT CCT GAA GAA GGA GAT TTA AAT CCA CCA CCA GAA 
Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro G^ 
b ° 55 60 

GCA AAA CAA GTT CCA GTT TCA TAT TAT GAT .TCA ACA TAT TTA AGT ACA 
Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr- Leu Ser Thr 
5 70 75 ao 

GAT AAT GAA AAA GAT AAT TAT TTA AAG GGA GTT ACA AAA TTA TTT ran 
Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr ™ 5£ Glu 

85 90 95 

AGA ATT TAT TCA ACT GAT CTT GGA AGA ATG TTG TTA ACA TCA ATA GTA 
Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He vll 
100 105 110 

AGG GGA ATA CCA TTT TGG GGT GGA AGT ACA ATA GAT ACA GAA TTA AAA 
Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu ^ 
US 120 125 

GTT ATT GAT ACT AAT TGT ATT AAT GTG ATA CAA CCA GAT GGT AGT TAT 
Val He Asp Thr Asn Cys lie Asn Val He Gin Pro Asp Gly Ser Tyr 
130 1^5 140 

AGA TCA GAA GAA CTT AAT CTA GTA ATA ATA GGA CCC TCA GCT GAT ATT 
Arg Ser Glu Glu Leu Asn Leu Val He lie Gly Pro Ser Ala As£ lie 

150 155 i60 

ATA CAG TTT GAA TGT AAA AGC TTT GGA CAT GAA GTT TTG AAT CTT ACG 
He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr 
165 170 175 

A™ ^ S? T Z AT 2? C TCT ACT CAA TAC ATT AGA TTT AGC CCA GAT TTT 
Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 

180 185 190 

Th^ Phi IT G ?° TCA CTT GAA ° TT GAT ACA AAT CCT CTT TTA 

Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 
195 200 205 

GGT GCA GGC AAA TTT GCT ACA GAT CCA GCA GTA ACA TTA GCA PAT ma 
Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr ™ A^a Hil G^ 

CTT ATA CAT GCT GGA CAT AGA TTA TAT GGA ATA GCA ATT AAT CCA AAT 
Leu He His Ala Gly Hxs Arg Leu Tyr Gly He Ala He £sn Pro 

230 235 240 

AGG GTT TTT AAA GTA AAT ACT AAT GCC TAT TAT GAA ATG AGT GCr tt £ 
Arg Val Phe Lys Val Asn Thr Asn Ala Tyr T^r Glu £? Ser G^y Uu 
2-15 250 



46 



96 



144 



192 



240 



2 88 



336 



384 



432 



480 



528 



576 



624 



67: 



720 



768 
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GAA GTA AGC TTT GAG GAA CTT AGA ACA TTT GGG GGA CAT GAT GCA Anr 
Glu Val Ser Phe Glu Glu Leu Arg Thr Phe S? His AsJ Ala L^s 

265 270 

TTT ATA GAT AGT TTA CAG GAA AAC GAA TTT CGT CTA TAT TAT TAT MT 
Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr A^n 

280 285 

AAG TTT AAA GAT ATA GCA AGT ACA CTT AAT AAA OCT AAA TCA ATA GTA 
Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala £ys Ser tie* vll 

295 300 

257 n£ T G f T TCA TTA CAG TAT ATG AAA AAT GTT TTT AAA GAG AAA 
Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Git £yt 

310 315 ' 320 

TAT CTC CTATCT GAA GAT ACA TCT GGA AAA TTT TCG GTA GAT AAA TTA 
Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser vll As£ ^ " u 

330 33S 

AAA TTT GAT AAG TTA TAC AAA ATG TTA ACA GAG ATT TAC ACA GAG GAT 
Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu 111 Tyr Thr Glu Asp 

AAT TTT GTT AAG TTT TTT AAA GTA CTT AAC AGA AAA ACA TAT TTG AAT 
Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr ™r llu £n 
J3:> 360 365 

TTT GAT AAA GCC GTA TTT AAG ATA AAT ATA GTA CCT AAG GTA AAT TAC 

Phe Asp Lys Ala Val Phe Lys He Asn lie Val Pro Lys Zll A^n Tyr 

. . •* 375 380 

ACA ATA TAT GAT GGA TTT AAT TTA AGA AAT ACA AAT TTA GCA GCA AAC 

rhr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu ftla A^a A^n 

j9v) ^qc 

jyb 400 

TTT "AAT GGT CAA AAT ACA GAA ATT AAT AAT ATG AAT TTT ACT AAA CTA 
Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lvs Leu 
405 410 415 

AAA AAT TTT ACT GGA TTG TTT GAA TTT TAT AAG TTG CTA TGT GTA AGA 
Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys vll Aro 
420 425 430 

GGG ATA ATA ACT TCT AAA ACT AAA TCA TTA GAT AAA GGA TAC AAT AAG 
Gly lie lie Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr A^n L^s 

^ -> => 440 445 

GCA TTA AAT GAT TTA TGT ATC AAA GTT AAT AAT TGG GAC TTG TTT TTT 
Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phi 

455 460 

CCT TCA GAA GAT AAT TTT ACT AAT GAT CTA AAT AAA GGA GAA GAA 
Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 
" 470 475 480 

ATT ACA TCT GAT ACT AAT ATA GAA GCA GCA GAA GAA AAT ATT AGT TTA 
lie Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn lie SeT lIu 
485 490 49s 

Aol 7™ tT A ^ CAA TAT TAT TTA ACC TTT AAT TTT GAT AAT GAA CC- 
Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 505 510 

GAA AAT ATT TCA ATA GAA AAT CTT TCA AGT GAC ATT ATA GGC CAA TTA 
Glu Asn He Ser He Glu Asn Leu Ser Ser Asp lie He Gly Gin lit 

b15 52 0 525 

GAA CTT ATG CCT AAT ATA GAA AGA TTT CCT AAT GGA AAA AAG TAT GAG 
Glu Leu Met Pro Asn lie Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



124 8 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 
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530 535 



540 



TTA GAT AAA TAT ACT ATG TTC CAT TAT CTT CGT GCT CAA GAA TTT GAA 
Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin, Glu Phe Glu 
. 550 555 I S60 

CAT GOT AAA TCT AGG ATT GCT TTA ACA AAT TCT CTT AAC GAA GCA TTA 
His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
565 570 575 

TTA AAT CCT ACT CGT GTT TAT ACA TTT TTT TCT TCA GAC TAT GTA AAG 
Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys 
580 565 590 

AAA GTT AAT AAA GCT ACG GAG GCA GCT ATG TTT TTA GGC TGG GTA GAA 
Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 
595 600 60S 

CAA TTA GTA TAT GAT TTT ACC GAT GAA ACT AGC GAA GTA AGT ACT ACG 
Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
610 615 6 20 

GAT AAA ATT GCG GAT ATA ACT ATA ATT ATT CCA TAT ATA GGA CCT GCT 
Asp Lys He Ala Asp He Thr lie He He Pro Tyr He Gly Pro All 
■ 630 635 640 

TTA AAT ATA GGT AAT ATG TTA TAT AAA GAT GAT TTT GTA GGT GCT TTA 
Leu Asn lie Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
645 650 655 

ATA TTT TCA GGA GCT GTT ATT CTG TTA GAA TTT ATA CCA GAG ATT GCA 
He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu lie Ala 
660 665 670 

lie Pro vlt T A T GCA CTT GTA TCA TAT ATr GCG *AT AAG 

lie Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 

675 680 gas 

GTT CTA ACC GTT CAA ACA ATA GAT AAT GCT TTA AGT AAA AG A AAT GAA 
Val Leu Thr Val Gin Thr lie Asp Asn Ala Leu Ser Cyt Arg" j£n G^u 
690 695 700 

AAA TGG GAT GAG GTC TAT AAA TAT ATA GTA ACA AAT TGG TTA GCA AAG 
Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala L^s 

710 7 15 720 

JT7' ^ A u A CAG ATT GAT ■ CTA - ATA AGA AAA AAA ATG AAA GAA GCT TTA 
Val Asn Thr Gin He Asp Leu lie Arg Lys Lys Met Lys Glu Ala Leu 
725 730 735 

GAA AAT CAA GCA GAA GCA ACA AAG GCT ATA ATA AAC TAT CAG TAT AAT 
Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr am 
740 7 45 750 

CAA TAT ACT GAG GAA GAG AAA AAT AAT ATT AAT TTT AAT ATT r*T r-a-r 
Gin Tyr Thr Glu Glu Glu Lys Asn Asn He £n Pnl £n ill AsJ Asp 
755 7 60 765 

TTA AGT TCG AAA CTT AAT GAG TCT ATA AAT AAA GCT ATG ATT AAT ATA 
Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met III jEn He 

/u 77 5 780 

AAT AAA TTT TTG AAT CAA TGC TCT GTT TCA TAT TTA ATG AAT TCT ATG 
Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
85 790 795 800 

ATC CCT TAT GGT GTT AAA CGG TTA GAA GAT TTT GAT GCT AGT CTT AAA 
He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu £J£ 
805 810 815 



1680 



1728 



1776 



1824 



1872 



192 0 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 
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. S XI.Sl SI US ?» ill £ Sg ffi S SJ S2 IS g; 

825 830 



845 



ill S IE as SI IS £ 52 2? S SI iS SI 2 2 

855 860 



940 



950 



955 



SI SI HI £ SJ SI ill iS ^ S Si St^ 2J K ss SI 

98 5 — — 



990 



55 KS SI IH IS £ill HI s; £ SI III 2? Si HI £ 

1030 1035 1040 

ATT TCA AAT TTA GGT AAT ATT CAT GCT ACT AAT AAT ATA ATG TTT 
He Ser Asn Leu Gly Asn lie „ ia A!a Ser £n £n lie p£ ^ 

60 1045 loso 1055 y 

leu Asp G? y Cyl Arg £1 Thr SI ^ ^ ^ A ™ ^ TAT TTT 

1060 P 9 Tyr Ile Tr P Ile Phe 

1065 1070 

HI SI SI SI H. SI Hi 2J Si HI SI IS HI 2j SI SI 

1O60 1085 

AsJ Ser £n ler oT V 7™ GAC TTT TGG GGT G " ™T 

P n oxn ser Asn Ser Gly n e Leu Lys Asp Phe Trp Gly Asp Tyr 



2496 



2544 



2592 



S SI S Si ?M HI HS HI in iS 21 S SI III s hi 

875 880 

Hi S £ SI s HI SI SI ill 2p Si E - "I S S 

890 895 

AAA ATA AAT ATT GGT AGT AAA GTA AAT TTT pat *™ ^ 

Ly. ,1. »«, jj. »„ s „ , y » ™ Jg £ «J C« JJJ J£.» jjj 

905 910 

Si SI Si SI IE Asn SI Si EES in s: & is SI 

ill ill S iS SJ £ iS X S £ Si HI IS IS £ ffi 



2640 



2688 



2736 



2784 



si tii s in iii hi ??i s si - » E ~ ™ - - „ 2e 

40 965 970 975 



2976 



3024 



in in si is s? sir si hi o ^ K si si? se hi si 

21 £ill HI IS IS SI 511 S IS £HIHI HI SI - 



3120 



3168 



3216 



3264 



3 312 
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15 



3360 



3408 



3456 



3504 



3552 



3600 



3648 



3696 



1090 1095 iioo 

TTA CAA TAT GAT AAA CCA TAC TAT ATG TTA AAT TTA TAT GAT CCA AAT 
Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 
1105 1110 1115 H20 

AAA TAT GTC GAT GTA AAT AAT GTA GGT ATT AGA GGT TAT ATG TAT CTT 
Lys Tyr Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu 
1125 H30 1135 

AAA GGG CCT AGA GGT AGC GTA ATG ACT ACA AAC ATT TAT TTA AAT TCA 
Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser 
1140 1145 ii 50 

AGT TTG TAT AGG GGG ACA AAA TTT ATT ATA AAA AAA TAT GCT TCT GGA 
Ser Leu Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Glv 
1155 H60 H65 

AAT AAA GAT AAT ATT GTT AGA AAT AAT GAT CGT GTA TAT ATT AAT GTA 
Asn Lys Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val 
1110 1175 H80 

GTA GTT AAA AAT AAA GAA TAT AGG TTA GCT ACT AAT GCA TCA CAG GCA 
Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 
1185 1190 H95 l20 0 

GGC GTA GAA AAA ATA CTA AGT GCA TTA GAA ATA CCT GAT GTA GGA AAT 
Gly Val Glu Lys He Leu Ser Ala Leu Glu lie Pro Asp Val Gly Asn 
1205 1210 1215 

CTA AGT CAA GTA GTA GTA ATG AAG TCA AAA AAT GAT CAA GGA ATA ACA 
Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr 
1220 1225 1230 

AAT AAA TGC AAA ATG AAT TTA CAA GAT AAT AAT GGG AAT GAT ATA GGC 3 74 4 

Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly 
1235 1240 1245 

TTT ATA GGA TTT CAT CAG TTT AAT AAT ATA GCT AAA CTA GTA GCA AGT 
Phe He Gly Phe His Gin Phe Asn Asn lie Ala Lys Leu Val Ala Ser 
I 250 1255 1260 

AAT TGG TAT AAT AGA CAA ATA GAA AGA TCT AGT AGG ACT TTG GGT TGC 
Asn Trp Tyr Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cvs 
1265 1270 1275 1280 

TCA TGG GAA TTT ATT CCT GTA GAT GAT GGA TGG GGA GAA AGG CCA CTG 
Ser Trp Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
1285 1290 1295 

TAA 

(2) INFORMATION FOR SEQ ID NO; 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Gin Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly 
1 5 io 



3792 



3840 



3888 



3891 



Val Asp He Ala Tyr lie Lys He Pro Asn Val Gly Gin Met Gin Pro 
20 25 30 

Val Lys Ala Phe Lys He His Asn Lys He Trp Val lie Pro Glu Arg 
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Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Le U A sn Pro Pro Pro Glu 

^ 60 
Ala L ys Gin Val Pro Val Ser T yr Tyr ASp Ser Thr Tyr ^ ^ ^ 

75 80 
Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu 

Arg Ile T yr ; „ Thr Asp , eu Gly Arig Met Leu Uu Thr Ser Val 

"I Gly SS *«. »e Asp Thr Glu Leu Lys 

X^U 125 

val ij. Asp Thr Asn Cys lie Asn val Ile Gln PrQ Q # ^ 

iJb 140 - 

Arg Ser Glu Glu Leu Asn Leu Val He Ile ? l y Pro Ser Ala ^ 

He Gin Phe Glu cy. L ys Ser P he Cl y His Glu Val Leu Asn Leu Z 

Arg Asn Gly T yr dy Ser Thr Gin Tyr Ile Arg Phe Ser Pro Phe 

Thr PhG aly .Ph. Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 

^ O0 205 
Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 

<;15 220 
Leu lie His Ala .cly His Arg Leu Tyr Gly lie Ala lie Asn Pro Asn 

** 2 3 5 2 

Arg val Phe Lys val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gl y Leu 

Glu val ser Phe Glu Glu Leu Arg Thr Phe Gl y Gl y His Asp 1^ Lyg 

^ bb 2 70 

Phe tie Asp ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr T yr Asn 

<- UD 285 

Lys Phe Lys Asp lie Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val 

295 300 

Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys 

Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 

330 335 

Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp 

345 350 
Asn Phe val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 

360 365 
Phe Asp Lys Ala Val Phe Lys lie Asn He Val Pro Lys Val Asn Tyr 

375 380 
Thr lie Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 

395 400 
Phe Asn Gly Gin Asn Thr Glu lie Asn Asn Met Asn Phe Thr Lys Leu 



410 415 



274 



10 
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Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Arg 
420 425 4 3o 

GXy He lie Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr Asn Lys 

440 445 

Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe 

4 55 460 

Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 

470 475 480 

lie Thr ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu 



15 485 490 49s 

Asp Leu lie Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 505 510 



20 



25 



Glu Asn lie Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 

520 52 5 

Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 

535 5 4 o 

Leu Asp Lys Tyr thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 

550 5SS 560 

His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 

30 565 570 575 

Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tvr Val Lys 
580 585 590 

Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 

600 60S 

Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
610 615 620 

Asp Lys lie Ala Asp He Thr He lie He Pro Tyr He Gly Pro Ala 

630 635 6 4o 

Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
45 645 « 5 0 655 

He Phe Ser Gly Ala Vai lie Leu Leu Glu Phe He Pro Glu He Ala 
660 665 67Q 

He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tvr He Ala Asn Lys 
675 680 685 

val Leu Thr Val Gin Thr lie Asp Asn Ala Leu Ser Lys Arg Asn Glu 

U 695 700 

Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys 

710 715 720 

Val Asn Thr Gin lie Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu 
60 730 7 35 

Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 

74 0 74 5 - 2 



40 



750 



6 . Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp 

760 755 



70 



Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He 

77 S 780 

Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
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78 5 



790 



795 



10 



15 



30 



40 



45 



60 



BOO 



He Pro Tyr G ly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 

8X0 815 
Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly T * r Leu He Gly 

925 830 

Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 

840 845 

He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu 

o55 - — 



860 



Leu Ser 



Thr Phe Thr Glu Tyr lie Lys Asn He He Asn Thr Ser He Leu Asn 

875 880 
Leu Arg Tyr Glu Ser Asn „ is Leu Ile Ser ^ ^ ^ ^ 

JO S90 895 

Lys He Asn He Gly Ser Lys Val Asn Phe Asp p roIlB Asp Lys ^ 

905 910 
Gin He Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu 

920 925 

Lys Asn Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser 

935 940 
Phe Trp lie Arg He j„ Lys Tyr Phe Asn Ser lie Ser Leu Asn Asn 

Glu Tyr Thr lie lie Asn Cys Mst Glu Asn Asn Ser Gly Trp Lys Val 
.i5 970 975 

Ser Leu Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu 



98 5 



990 



He Lys Gin Arg Val Val Phe Lys Tyr Ser Gin Met lie Asn He Ser 

1000 1005 
Asp Tyr lie Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu 

1015 1020 
Asn Asn ser Lys lie Tyr lie Asn Gly Arg Leu He Asp Gin Lys Pro 

1030 "35 1040 

He ser Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys 
50 1045 1050 10S5 

Leu Asp Gly cys Arg Asp Thr His Arg Tyr He Trp lie Lys Tyr Phe 

1065 1070 

55 L ° U ^75 ASP ^ A ^ n G1U L *» G1 " >1« ^= Asp Leu Tyr 

1080 1085 

Asp As^Gln ser Asn Ser Gly He Leu Lys Asp phe Trp Qly Agp 

AU " 1100 

Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 

1110 , 1115 1120 

Lys Tyr Val Asp Val Asn Asn Val Gly He Arg cly Tyr Met Tyr Leu 

65 . 1125 "30 iijs 

Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn lie Tyr Leu Asn Ser 

1145 1150 

?() Ser Leu Tyr Arg Gly Thr Lys Phe n e He Lys Lys Tyr Ala Ser Gly 

1160 1165 
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Asn Lys Asp Asn lie Val Arg As n As n Asp Arg Val Tyr He Asn Val 
1170 1175 1180 

Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala. Ser Gin Ala 
1185 1195 I 1200 

Gly Val Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn 
1205 1210 1215 

Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr 
1220 1225 1230 

Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Glv 

1235 1240 1245 - . 

Phe He Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser 
1250 1255 1260 



20 l265 TrP ^ ^ ^ ?270 Ile ^ ^ Gly CyS 

Ser Trp Glu Phe lie Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
1285 1290 1295 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 29: 

fi) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(BJ TYPE: nucleic acid 
<C) STRANDEDNESS : single 
!D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = M DNA " 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 
CGCCATGGCT AGATTATTAT CTACATTTAC 
40 (2) INFORMATION FOR SEQ ID NO: 30: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
4: * (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = M DNA " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GCAAGCTTCT TGACAGACTC ATGTAG 
^ (2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACCATG GGCCATCATC 
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AXCAXCAXCA XCAXCATCAT CACAGCAGCG . GCCAXATCGA AGGTCGTCAT . ATGGCTAGCA X1 

tggctagAxx attatctaca tttactgaat atattaagaa . tattattaat ACTTCTATAT 2 < 

5 TGAATTTAAG ATATGAAAGT AAXCAXTTAA TAGACTTATC TAGGTATGCA TCAAAAATAA 3( 

ATATTGGTAG xaaagxaaax tttgatccaa XAGAXAAAAA tcaaattcaa TTATTTAATT ' 

)() TAGAAAGTAG TAAAATTGAG GXAATTXXAA AAAATGCTAT TGTATATAAT AGTATGTATG 42 

AAAATTXXAG XACXAGCXTX XGGAXAAGAA XXCCTAAGXA TTTTAACAGT ATAAGTCTAA 4e 

ATAATCAAXA XACAAXAAXA AATTGTATGG AAAATAATTC AGGATGGAAA GXAXCACTXA 54 

ATTATGGTGA AAXAAXCXGG ACTTXACAGG AXACXCAGGA AATAAAACAA AGAGTAGTTT 60 

TTAAATACAG TCAAATGATT AATATATCAG ATTATATAAA CAGATGGATT TTTGTAACTA 6C( 

2() TCACXAAXAA TAGATTAAAT AACXCXAAAA TTTATATAAA XGGAAGAXTA ATAGATCAAA 7 , ( 

AACCAATTTC AAATTTAGGT AATATTCATG CXAGXAAXAA TATAATGTTT AAATTAGATG ^ 

GTTGTAGAGA T ACACA T AGA XAXA^GCA TAAAAXATXX TAAXCXXXTX GATAAGGAAT 84c 

- XAAAXGAAAA AGAAAXCAAA GATTTATATG AXAAXCAAXC AAATTCAGGT ATTTTAAAAG 90t 

acxt-xxgggg XGAXXAXXXA CAAXAXGAXA AACCAXACXA XAXGXXAAAX XXAXAXGAXC 96fi 

30 CAAAXAAATA XGXCGATGTA AAXAAXGXAG GTAXXAGAGG XXAXAXGXAX CXXAAAGGGC 1020 

CXAGAGGTAG CGXAAXGACX ACAAACAXTX AXXXAAAXXC AAGXXXGXAX AGGGGGACAA 10 S0 

AAXXXAXTAX AAAAAAAXAX GCXXCTGGAA AXAAAGAXAA XAXXCXXAGA AATAAXGATC 1140 
" GTGT ^XAX TAAXGXAGXA GTXAAAAAXA AAGAAXAXAG GXXAGCXACT AAXGCAXCAC 
AGGCAGGCGX AGAAAAAAXA CTAAGXGCAX XAGAAAXACC XGATGXAGGA AATCXAAGXC 
40 AAGTAGTAGT *™*»*A AAAAATGAXC AAGGAAXAAC AAAXAAAXGC AAAATGAAXT 
TACAAGAXAA X AAXGGGAAT GAXAXAGGCX XXAXAGGAXX TCAXCAGXXX AAXAAXAXAG 



50 



TCXCAAACTA CAXGAGXCXG XCAAGAAGCX TGCGGCCGCA CXCGAG 
(2) I NFORMAX I ON FOR SEQ ID NO: 32 



<i) SEQUENCE CHARACXERISXICS - 

(A) L.ENGXH : 9 amino acids 

(B) XYPE: amino acid 

SS <C) SXRANDEDNESS : not relevant 

<D> XOPOLOGY : not relevant 

(ii) MOLECULE XYPE: peptide 
6() <Xl) SE Q«ENCE DESCRIPTION: SEQ ID NO:3 2: 

Met His His His His His His Met Ala 
1 5 

65 INFORMATION FOR SEO ID NO:33: 

(i) SEQUENCE CHARACXERISXICS- 

(A) LENGXH: 21 base pairs 

(B) XYPE: nucleic acid 
70 <C> SXRANDEDNESS : single 

(D) XOPOLOGY: linear 



1200 
1260 
1320 
1380 



CXAAACXAGX AGCAAGXAAX XGGXAXAAXA GACAAAXAGA AAGAXCXAGX AGGACXXXGG 144G 
GTTGCTCATG GGAAXXXAXX CCXGXAGAXG AXGGAXGGGG AGAAAGGCCA CXGXAAXXAA 
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(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TATGCATCAC CATCACCATC A 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA *' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

CATGTGATGG TGATG GTG AT GCA 2 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

<ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1335 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 

ATG CAT CAC CAT CAC CAT CAC ATG GCT CGT CTG CTG TCT ACC TTC ACT 4 
Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
1 5 10 15 

GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC TAC 9 
Glu Tyr , He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arq Tyr 
20 25 30 

GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC AAC 14 
Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys lie Asn 
35 40 45 

ATC GGT TCT AAA GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC CAG 19 
He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin lie Gin 
50 55 60 

CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT GCT 24 
Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 
65 70 75 80 



ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG ATC 
He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp lie 
85 go 95 



28 



CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC ACC 3 3 

Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 
100 105 no 

ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG AAC 38 
He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 
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AAT CGC TGG ATC TTC GTT ACC ATC Arc wr' 4 »T 

Asn Arg Trp He Phe Val Thr lie Thr £££ ^ * GT CTG ^ ^ TCC 

& S5 I£ ;i« 25 « SS 2S : ffi:2J S Si S tI5 £ 21 



185 19b 



25 SI 25 tis ss S 151 21 25 ?I5 S JI C r CTO «= ■» 

195 Vnn Gt Phe L ^ s Leu Asp Gly 

" iUU 205 

Cyl SJ Asp Thr £E ™° TAC TTC AAT CTG TTC 

210 9 Iie lr P Iie Tyr Phe Asn Leu Phe 

220 

^ Si' £S ^ ?K {« ^ «C -C TAC CAC AAC CAC 

=25 ° lu IIe Lvs Asp Leu Tyr Asp Asn Gin 



35 240 



.E 21 ESJ tI5 S 12 2? S 5? - ™ - - 



250 255 



til iZ S i£ £5 K 25 25 25 S? 21 S 25 £2 I~ ST 



265 270 



25 SS iS 21 SJ s; S 2J - - Met EE - - S 
S SI SI SSI 2? S £ 25 ?I5 £5 25 25 E E £2 I~ 

b " 300 



s; ss. s s: ts k ?i5 w s: s? s e si 25 ^ 25 



31 * 320 



!S K5 21 25 K 21 21 S S? ^ K J" SS S SIT ^ 
22 X SJ ^ JS 515 S S 25 S E §2 SI S 52 



350 



& ffi us si s 25 sr s s as si si r 5 tc tct «<= 

355 vro AS P val Gly Asn Leu Ser Gin 



360 365 



SS vll S2 £? ^ E ^ 2S 25 S?S SJ ;i. s 2S « 

375 380 

i S? 21 SS 2j 2S 21 SS 2S 21 K SJ JE - ^ 
E 2J S!-K 25 21 K S 2:.S SIT S E 25 ?£ £ 
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528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 
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405 410 415 

AAT CGT CAG ATC GAA CGT TCC TCT CGC ACT CTG GGT TGC TCT TGG GAG 12 96 

Asn Arg Gin lie Glu Arg Set Ser Arg Thr Leu Gly Cys Ser Trp Glu 
420 425 t 430 

TTC ATC CCG GTT GAT GAC GGT TGG GGT GAA CGT CCG CTG TAACCCGGGA 134 5 

Phe lie Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
435 440 445 



AAGCTT 

(2) INFORMATION FOR SEQ ID NO: 36: 

<i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY; linear 

20 (ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
-=> 1 5 10 15 

Glu Tyr He Lys Asn He lie Asn Thr Ser He Leu Asn Leu Arg Tvr 
20 25 30 
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Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys lie Asn 
3 5 4 0 45 

lie Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He Gin 
50 55 60 

Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 
65 70 75 80 

He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp He 
85 90 95 

Arg He Pro Lys Tyr Phe Asn Ser lie Ser Leu Asn Asn Glu Tyr Thr 
100 105 no 

lie He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 

Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu lie Lys Gin 
13u 135 140 

Arg Val Val Phe Lys Tyr Ser Gin Met He Asn lie Ser Asp Tyr He 
145 ISO 1S5 160 

Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn Ser 
165 170 175 

Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser Asn 
180 185 190 

Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp Gly 
195 200 205 

Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu Phe 
210 215 220 

Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn Gin 
225 230 235 240 

Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin Tyr 
245 250 255 
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;as P Lys Pro ^ r Tyr Met Leu Asn ^ r Asp Pro ^ Lya Tyr vai 

' .. . bb 270 

Asp Val Asn Asn Val Gly n e Ara Glv m 

275 Y 280 7 Tyr Leu Lys G1 y Pro 

- g Cly S .r VaX Met Thr * hr Asn Ile Tyr ^ Asn Ser Ser Leu Tyr 
Oly THr Lys Phe „. Ile Lys Lys Tyr ^ ^ ° l ^ ^ 

A S „ Ile val Arg A| „ Asn Asp Arg val ^ ne v ^ v ^ ^ . ™ 

Asn , ys alu 2, Arg ^ Ala Thr wa a ^ ^ Qlu 

345 350 
-VS lie l.„ ser A. a Leu G lu lie Pro Asp Val Gly A| „ Leu ^ ^ 

Val Va i Val Met Lys ser ^ ^ ftsn Lys ^ 

-I Met Asn Leu Gln A f£ Asn *»« <u y Asp "I Gly phe Ile Glv 

Ph, „i, Uin Phe ^ Asn ftla ^ ^ ^ 

A=n Arg Gln „ ; Glu Arg ser ^ ^ 3^ ^ 

Phe lie Pro Va! Asp Asp Giy Qly Qlu ^ ^ "° " 

445 

12) INFORMATION FOR SEQ ID NO: 37 : 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 27 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : sinqle 

(D) TOPOLOGY: linear 

ill) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 : 

CGCATATGAA TATTCGTCCA TTGCATG 

(2) INFORMATION FOR SEQ ID NO: 38: 

<il SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 27 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui> MOLECULE TYPE: other nucleic acid 
MA) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 38: 

GGAAGCTTGC AGGGCAATTA CATCATG 

(2) INFORMATION FOR SEQ ID NO: 39 : 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 3876 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..3873 



(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 3 9 : 

ATG CCA GTT ACA ATA AAT AAT TTT AAT TAT AAT GAT CCT ATT GAT AAT 

Met Pro Val Thr lie Asn Asn Phe Asn Tyr Asn Asp Pro lie Asp Asn 
11 5 10 15 



48 



96 



GAC AAT ATT ATT ATG ATG GAA CCT CCA TTT GCA AGG GGT ACG GGG AGA 
Asp Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Ara 
20 25 30 

TAT TAT AAA GCT TTT AAA ATC ACA GAT CGT ATT TGG ATA ATA CCC GAA 144 
Tyr Tyr Lys Ala Phe Lys lie Thr Asp Arg lie Trp He He Pro Glu 
3 5 4 0 4 5 

2> AGA TAT ACT TTT GGA TAT AAA CCT GAG GAT TTT AAT AAA AGT TCC GGT 19"> 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

ATT TTT AAT AGA GAT GTT TGT GAA TAT TAT GAT CCA GAT TAC TTA AAT 24 0 

He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 
65 70 75 80 

ACC AAT GAT AAA AAG AAT ATA TTT TTC CAA ACA TTG ATC AAG TTA TTT 288 
Thr Asn Asp Lys Lys Asn He Phe Phe Gin Thr Leu He Lys Leu Phe 
85 90 95 

AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 3 36 

Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 no 

ATA AAT .GGT ATA CCT TAT CTT GGA GAT AGA CGT GTT CCA CTC GAA GAG 3 84 

He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
US 120 125 

45 TTT AAC ACA AAC ATT GCT AGT GTA ACT GTT AAT AAA TTA ATT AGT AAT 4 32 

Phe Asn Thr Asn lie Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 135 140 

CCA GGA GAA GTG GAG CGA AAA AAA GGT ATT TTC GCA AAT TTA ATA ATA 4 80 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 
145 150 155 160 

TTT GGA CCT GGG CCA GTT TTA AAT GAA AAT GAG ACT ATA GAT ATA GGT 528 
Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp He Gly 
33 165 170 175 

ATA CAA AAT CAT TTT GCA TCA AGG GAA GGC TTT GGG GGT ATA ATG CAA 5 76 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

. ATG AAA TTT TGT CCA GAA TAT GTA AGC GTA TTT AAT AAT GTT CAA GAA 6 24 

Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 2 05 

AAC AAA GGC GCA AGT ATA TTT AAT AGA CGT GGA TAT TTT TCA GAT CCA 6 72 

Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

GCC TTG ATA TTA ATG CAT. GAA CTT ATA CAT GTT TTG CAT GGA TTA TAT 72 0 

t\) Ala Leu lie Leu Met His Glu Leu lie His Val Leu His Gly Leu Tyr 
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225 230 235 ' 
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£i si ^ sr s sk 25 s w iB sir = as 

300 



E SI 52 SIT 55 s IS SI SS 21 ?H 25 s 21 JS ?*i 

315 320 

AAA AAT AAA TTT AAA GAT AAA TAT AAA TTr ptt r-n* ™^ 
«*- Asn Lys Phe ^ Asp Lys Ty - ^ «J CTT GAA GAT TCT GAA GGA 



330 335 



4 00 



TCT GAT AAA AAT ATG GGA AAA GAA TAT Arr firr r-*r- » „ 

Ser Asp Lys Asn ^ Gly Ly . G - «? j™ COT CAC AAT AAA OCT ATA 



410 41S 



4 70 

480 



475 



^ k sg is is - « : - -» ™ - ™ „, ^ „ 
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2S SJ SI S K S S S SI 25 £ & iR Si 
?5I i£ SI SI ™ SI S SS E 12 S g S» Si ffi Si 

S5 SI 21 SI 25 |K 55 i?I S S S5 Si SI SI 25 K. - 
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SS iK Si £Z 55 SI E SI SI SI S » ts | - - 
SI 25 SI 25 25 115 HI til 25 - SI 21 - - S 
SI SI J2 5? SI 25 SI .25 Sli Si III 25 ?S S5 25 25 »« 
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1392 
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GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC GCT ATA AAA 1584 
Asp Phe Asn Vai Asp Val Pro Val Tyr Glu Lys Gin Pro Ala lie Lys 
515 520 525 

AAA GTT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 1632 
Lys Val Phe Thr Asp Glu Asn Thr lie Phe Gin Tyr Leu Tyr Ser Gin 
530 535 540 

ACA TTT CCT CTA AAT ATA AGA GAT ATA AGT TTA ACA TCT TCA TTT GAT 1680 
Thr Phe Pro Leu Asn lie Arg Asp lie Ser Leu Thr Ser Ser Phe Asp 
545 550 555 560 

GAT GCA TTA TTA GTT TCT AGC AAA GTT TAT TCA TTT TTT TCT ATG GAT 1728 
Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

TAT ATT AAA ACT GCT AAT AAA GTA GTA GAA GCA GGA TTA TTT GCA GGT 17 76 

Tyr lie Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 
580 585 590 

TGG GTG AAA CAG ATA GTA GAT GAT TTT GTA ATC GAA GCT AAT AAA AGC 1824 
Trp Val Lys Gin lie Val Asp Asp Phe Val lie Glu Ala Asn Lys Ser 
595 600 605 



23 AGT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA 1872 

Ser Thr Met Asp Lys lie Ala Asp He Ser Leu He Val Pro Tyr He 
610 615 620 



GGA TTA GCT TTA AAT GTA GGA GAT GAA ACA GCT AAA GGA AAT TTT GAA 1920 
Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 
625 630 635 640 



AGT GCT TTT GAG ATT GCA GGA TCC AGT ATT TTA CTA GAA TTT ATA CCA 1968 
Ser Ala Phe Glu lie Ala Gly Ser Ser lie Leu Leu Glu Phe lie Pro 
-° 645 650 655 



GAA CTT TTA ATA CCT GTA GTT GGA GTC TTT TTA TTA GAA TCA TAT ATT 2016 
Glu Leu Leu He Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 
660 665 670 

GAC AAT AAA AAT AAA ATT ATT AAA ACA ATA GAT AAT GCT TTA ACT AAA 2064 
Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 685 

AGA GTG GAA AAA TGG ATT GAT ATG TAC GGA TTA ATA GTA GCG CAA TGG 2112 
Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trp 
690 695 700 

CTC TCA ACA GTT AAT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT . 216 0 
Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tyr 
705 710 715 720 



AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 22 08 

Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lys Tyr 
^ 725 730 735 



AAA TAT AAT ATA TAT TCT GAA GAG GAA AAG TCA AAT ATT AAC ATC AAT 2256 

Lys Tyr Asn He Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 
740 745 750 

TTT AAT GAT ATA AAT TCT AAA CTT AAT GAT GGT ATT AAC CAA GCT ATG 2 3 04 

Phe Asn Asp He Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
755 760 765 

GAT AAT ATA AAT GAT TTT ATA AAT GAA TGT TCT GTA TCA TAT TTA ATG 2 3 52 

Asp Asn He Asn Asp Phe He Asn Glu Cys Ser Val Ser Tyr Leu Met 

770 775 780 

AAA AAA ATG ATT CCA TTA GCT GTA AAA AAA TTA CTA GAC TTT GAT AAT 24 00 

Lys Lys Met He Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 
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50 90S * 990 

AAT AGA TGG TTT TTT GTA 
Asn Arg Trp Phe Phe Val 
iooo i005 

ATT TAT ATT AAT GGC 
He Tyr He Asn Glv 
1015 1020 

« nz tn E S£ p s; sr - tn - - - a; }» 

1030 «3S 1040 



S E i£ 25 25 Sf ^ £ 3J 52 SI £ ™ 

1050 1055 

£ ??i ^ e si m ™ s sk i ta r c t* tca - t a " — 

1060 ?«c U ^ Gin ASn Ile L V S 



2446 



24 96 



2 592 



2640 



X S SB S SI JS E E? S £ E2.g s; - 

15 a ffi & s si ;x ^ ^ ss k si s S s» s tir 
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25 & SO S? I- ^ Sj S ^ 25 S 5 IS E= 25 S *- 
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xi s; s ?ii.'is;'jii s ^ si AGG r T gat q « ata c - 

• 930 9 „° LyS Tyr Ar 9 Asn Asp Asp He G ln Asn 

SI til £1 S£ S A I A ^ T ^ *™ AAA AAT AAT TCA 

94 5 Thr Ile Ile Asn Met Lys Asn Asn Ser 

955 960 

45 ^ - - K - - - Asn « » S Trp £ ^ tn 

- - - i - S £ Ser KJ £ SI Si SI j£ S ~ 

G^U AsJ K SK SI tie 21 A GA 1°° TTT GTA ^ ATT ACT 

99 S 6r blU Tyr Ile Asn Ar 9 Trp Phe Phe Val Thr He Thr 

1000 1005 

ts J£ zz 2; £ n 2S J£ X ^ & SK £ ™ «» j» «» 

1010 ioic y Giy Thr Leu Ser 
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"^?--fT- *** ^TT CAA TCA TAT AGC GAA TAC TTA AAA GAT TTT TGG 

TX^ LyS I1S ° ln Ser Tyr Ser Glu Leu Lys Asp Phe Trp 

1075 1Q 80 1085 

GGA AAT CCT TTA ATG TAT MT AAA GAA TAT TAT ATG TTT 1 AAT GCG OCT 

Y to2 0 Pr ° LSU M6t Tyr AS " LyS G1U T ^ Tyr Met Pnl fttn Ala G^v 
luy u 1095 



1100 



£n ^ I CA TAT A T T CTA GT G AAA GAT TCA TCT GTA GGT GAA 

Asn Lys Asn Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu 

5 " 1-110 HIS 1120 

ATA TTA ATA CGT AGC AAA TAT AAT CAG AAT TCC AAT TAT ATA AAT -at 
He Leu He Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr lie £sn T^r 
1125 1130 113 s 



TTA CGT AAA AAG TAT AAA GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 

t-In LyS LyG Tyr LyS ASP TyL " Phe C Y S Iie Ser L >'s Trp Tyr Leu 
±Jt - b0 1255 l^fifj 



1270 1275 12 80 

CAG TTT ATT CCT. AAA GAT GAA GGG TGG ACT GAA TAA 
Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

v2) INFORMATION FOR SEQ ID NO: 40: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1291 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION ; SEQ ID NO: 40: 

Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
70 5 10 15 



3 2 64 



3312 



3360 



3408 



3504 



355; 



a™ ^ 7 TA £ AT GGA GAA TTT ATT ATA A GA AGA GAG TCA AAT 

Axg Asn Leu Tyr He Gly Glu Lys Phe He He Arg Arg Glu Ser 

20 1140 1145 H50 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT 
Ser Gin ser He Asn Asp Asp He Val Arg Lys Glu tsp iyr He His 
* ii5 1160 1165 

25 CTA GAT TTG GTA CTT CAC CAT GAA GAG TGG AGA GTA TAT GCC TAT AAA 

Asp Leu Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr bvs 

U 1175 iiao 

30 p£ u° ^ Stn CU, f** 7 TG T" TTA TCT ATT ATA AGT GAT 3600 

; c ™ Lys Glu Gln Glu Glu Lys Leu Phe Leu Ser lie lie Ser Asd 
1190 H95 12 00 

TCT AAT GAA TTT TAT AAG ACT ATA GAA ATA AAA GAA TAT GAT GAA CAC 364B 
Ser Asn Glu Phe Tyr Lys Thr He Glu He LVs Glu Tvr Asp Glu Gin 
1205 1210 ' X215 

1°* £ AT AGT TGT CAG TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT 
Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
4() 1220 1225 1230 

a™ A T A TTG A T T -GGT ATT CAT CGT TTC TAC GAA TCT GGA GTT 3 74 4 

Asp Asp lie Gly Leu He Gly He His Arg Phe Tvr Glu Ser GIv Val 
1235 1240 1245 



3696 



3792 



SO fCt Glu vI A AGG f** » CA TAT ** G TCA ™ T TTG TGT AAT TGG 3 8-10 

Lys Glu /al Lys Arg Lys Pro Tyr Lys ser Asn Leu Gly Cys Asn Trp 

10"7(\ ^ 



3B76 
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Asp Asn He lie Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 

25 30 

Tyr Tyr Lys Ala Phe Lys lie Thr Asp Arg ileTrp He He Pro Glu 

* J5 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 



60 



lie Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

70 7 5 80 

Thr Asn Asp Lys Lys Asn He Phe Phe Gin Thr Leu He Lys Lou Phe 

8b 9 ° 95 

Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 

105 110 

He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 

120 12 S 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 

135 140 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 

150 155 160 

Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp He Gly 
165 17 o 175 

lie Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly lie Met Gin 
180 190 

Met. Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 



220 



Ala Leu lie Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 

230 235 240 

Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 2S0 255 

Phe Met Gin Ser Thr Asp Thr He Gin Ala Glu Glu Leu Tvr Thr Ph* 
260 265 270 

Gly Gly Gin Asp Pro Ser He He Ser Pro Ser Thr Asp Lys Ser He 
275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 
u 295 300 

Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn He Tyr 

310 315 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Glv 
325 330 335 r 

Lys Tyr Ser He Asp Val Glu Ser Phe Asn Lys Leu Tyr Lys ser Leu 
340 345 350 

Met Leu Gly Phe Thr Glu He Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 Y 

Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lys 
30 375 380 

Asn Leu Leu Asp Asn Glu He Tyr Thr lie Glu Glu Gly Phe Asn He 
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385 390 395 400 

Ser Asp Lys Asn Met Gly Lys Glu Tyr Arg Gly Gin Asn Lys Ala lie 
405 410 415 

Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tvr 

420 . 425 ' - — 



430 



Lys lie Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asn 
435 440 * 



445 



Val Asp Asn Glu Asn Leu Phe Phe He Ala Asp Lys Asn Ser £he ser 
450 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Asn Thr Gin Asn Asn 
465 470 475 4 80 

Tyr lie Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asn 
485 490 H 



495 



Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lvs 
515 520 525 

Lys Val Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tvr Ser Gin 
530 535 

Thr Phe Pro Leu Asn He Arg Asp He Ser Leu Thr Ser Ser Phe Asp 
J4b 550 555 560 

Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Glv 
580 56S 59 0 ■ - 

Trp Val Lys Gin He Val Asp Asp Phe Val lie Glu Ala Asn Lys Ser 
595 600 60S 

Ser Thr Met Asp Lys He Ala Asp He Ser Leu lie Val Pro Tyr He 
610 615 620 

Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 

630 635 640 

Ser Ala Phe Glu He Ala Gly Set: Ser He Leu Leu Glu Phe He Pro 
645 650 655 

Glu Leu Leu He Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 
660 665 670 

Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 



685 



Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trp 
690 695 700 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tvr 
705 710 715 720 

Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu lie lie Lvs Tyr 
725 730 735 

Lys Tyr Asn lie Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 
740 745 750 

Phe Asn Asp lie Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
755 760 765 
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■ Asp As„ lle Asn Asp Phe lie Asn Glu Cys Ser Val Ser Tyr Leu Met 

' /b 780 
Lys Lys Met lie Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 

795 j 800 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr lie Asp Glu Asn Lys Leu Tyr 

UU slo G1U ASP G1U S « ^ Val Asp Lys HI Leu 

U2b 830 

Lys Thr lie He Pro Phe Asp Leu Ser Thr Tyr Ser Ann He Glu He 

840 645 

Leu lie Lys He Phe Asn Lys Tyr Asn S*>*- m„ n~ i . 

850 ojc 1 Ser Glu Ile Leu Asn Asn He 

860 

lie Leu Asn Leu Arg Tyr Arg Asp Asn Asn Leu lie Asp Leu Ser Gly 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys 

Asn Gin Phe Lys Leu Thr Ser Ser Ala Asp Ser Lys He Arg va^ Thr 

Gin Asn Gin Asn lie He Phe Asn Ser Met Phe Leu Asp Phe Ser Val 

920 925 
-Ser Phe Trp He Arg He Pro Lys Tyr Arg Asn Asp Asp He Gin Asn 

yJb 940 

Tyr He Hi. Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 

9 55 960 

GlyTrp Lvs He Ser He Arg Gly Asn Arg He lie Trp Thr Leu lie 

970 975 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg 

985 990 
Glu Asp He ser oi M Tyr He A sn Arg Trp Phe Phe Val Thr He Thr 

1000 1005 
Asn Asn Leu Asp Asn Ala Lys lie Tyr lie Asn Gly Thr ^ ^ ' 

i01j 1020 

Asn Met Asp He Lys Asp^le Gly Glu Val He Val Asn Gly Glu He 

1035 1040 

Thr Phe Lvs Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met 

1050 



1055 



LVS Tyr Phe Senile Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lys 

1065 1070 
Glu lie ryr Ly » He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 

iuao 1085 

Gly Asn Pro Leu Met Tyr Asn Lvs Glu Tyr Tyr Met Phe Asn Ala Gly 

1095 1100 

uSs^ ^ ^ lil 0 LyS LSU Val Asp Ser Ser Val Gly Glu 

1115 1120 
He Leu He Arg Ser Ly 9 Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr 

1130 1135 
Arg Asn Leu Tyr He Gly Glu Lys Phe lie He Arg Arg Glu Ser Asn 
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■■ . ■ ' .1 

1140 1145 iiso 

Ser Gin Ser lie Asn Asp Asp lie Val Arg Lys Glu Asp Tyr lie His 
1155 H60 116.5 

Leu Asp Leu Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lvs 
1170 1175 iiao 7 

Tyr Phe LysGlu Gin Glu Glu Lys Leu Phe Leu Ser He He Ser Asd 
1185 1190 1195 1200 

Ser Asn Glu Phe Tyr Lys Thr He Glu He Lys Glu Tyr Asp Glu Gin 
1205 1210 i 2l5 

Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

Asp Asp He Gly Leu He Gly lie His Arg Phe Tyr Glu Ser Gly Val 
123S 1240 1245 

Leu Arg Lys Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys TrD Tyr Leu 
1250 1255 126O * 

Lys Glu Val Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cvs Asn Trp 
1265 1270 1275 * 1280 

Gin Phe lie Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID NO : 4 1 : 

. fi) SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 3876 base oairs 
(B) TYPE: nucleic acid" 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

Ux) FEATURE: 

(A) NAME /KEY : CDS 

( B) LOCATION: 1..3873 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 

ATG CCA GTT ACA ATA AAT AAT TTT AAT TAT AAT GAT CCT ATT GAT AAT 
Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
1 5 10 IS 

AAT AAT ATT ATT ATG ATG GAG CCT CCA TTT GCG AGA GGT ACG GGG AGA 
Asn Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Glv Aro 
20 2S 30 

TAT TAT AAA GCT TTT AAA ATC ACA GAT CGT ATT TGG ATA ATA CCG GAA 1 
Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Tro He He Pro Glu 
3 5 4 0 45 

AGA TAT ACT TTT GGA TAT AAA CCT GAG GAT TTT AAT AAA AGT TCC GGT l 
Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

ATT TTT AAT AGA GAT GTT TGT GAA TAT TAT GAT CCA GAT TAC TTA AAT 2 
lie Phe Asn Arg Asp Val Cys Glu Tvr Tyr Asp Pro Asp Tyr Leu Asn 
65 70 75 eo 

ACT AAT GAT AAA AAG AAT ATA TTT TTA CAA ACA ATG ATC AAG TTA TTT 
Thr Asn Asp Lys Lys Asn He Phe Leu Gin Thr Met lie L ys Leu Phe 
85 90 95 

AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 3 
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Asn Arg He Lys Se r Lys Pro Leu Gly Glu Lys Leu Leu Glu Met lie 

105 110 

tli SI ^ JS S I?I - fij SI IS - - S ffi Si SK 
3 E S KS tS SI. Si £ SI Asl £ Si tS XI SI 

s s?. sk s?.s; s si ^ss : jn IS St SI Si }» s 

HI !K. K SS SS SI 12 IE. Si SI S E SSI til 5 

is sz. si si in st si « ^ « ™ « w ^ 2 c„ 

21? IS III IS 55 Si & Si E St S S S SI Si Si 
US Si S S SI 52 III SI is si ~ - £ E «* cc> 
SS IS ill SI SS SI SK SI tit SI SIT SI 511 SI SI Ill 

:ss tn st st. si si Hi ss tii st li tii si - « £ 
in ii? 'aj si s si si si s? is si si s si S si 

265 270 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



st si si °ti s e ts s s s si ts si si e ts 

"I SI Si SI IIS SI SI HI iS Sg is SI SI is' SI s; - 

300 

s? sr s: si is trt si si s si in si in m ?™ - - 

315 320 
AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC PTT raa ^™ 

Asn Ly S Phe Lys Asp Lys Ty ? ™ vll £J ™ £* 

330 335 

tSIIIE ffi SI St SI £S SI SI St III St S Si _ 
21? HI SI HI S Si - i£ sj «; - 5 ™ »j 

S IS SI SI HI IS E SI SI Si S Si Si si tli st 

i75 380 



1104 



1152 
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AAT TTA TTA GAT AAT GAA ATC TAT ACT ATA GAG GAA GGG TTT AAT ATA 

Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He- 
395 390 395 400 

TCT GAT AAA GAT ATG GAA AAA GAA TAT AGA GGT CAG AaV AAA GCT ATA 

Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Gly Gin Asn Lys Ala He 
405 410 415 



AAG ATA CAA ATG TGT AAA AGT GTT AAA GCT CCA GGA ATA TGT ATT GAT 
Lys He Gin Met Cys Lys Ser Val Lys Ala Pro Gly He Cys lie Asp 
4 3 5 4 4 0 4 4 5 

GTT GAT AAT GAA GAT TTG TTC TTT ATA GCT GAT AAA AAT AGT TTT TCA 
Val Asp Asn Glu Asp Leu Phe Phe lie Ala Asp Lys Asn Ser Phe Ser 
450 455 460 



20 



40 



620 



GGA TTA GCT TTA AAT GTA GGA AAT GAA ACA GCT AAA GGA AAT TTT GAA 

Gly Leu Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu 

625 630 635 640 

65 AAT GCT TTT GAG ATT GCA GGA GCC AGT ATT CTA CTA GAA TTT ATA CCA 

Asn Ala Phe Glu He Ala Gly Ala Ser He Leu Leu Glu Phe He Pro 
645 650 655 

GAA CTT TTA ATA CCT GTA GTT GGA GCC TTT TTA TTA GAA TCA TAT ATT 

n) Glu Leu Leu He Pro Val Val Gly Ala Phe Leu Leu Glu Ser Tyr He 



1200 



1248 



AAT AAA CAA GCT TAT GAA GAA ATT AGC AAG GAG CAT TTG GCT GTA TAT 12 96 

■10 Asn Lys Gin Ala Tyr Glu Glu lie Ser Lys Glu His Leu Ala Val Tyr 

420 425 430 



GAT GAT TTA TCT AAA AAC GAA AGA ATA GAA TAT AAT ACA CAG AGT AAT 
Asp Asp Leu Ser Lys Asn Glu Arg lie Glu Tyr Asn Thr Gin Ser Asn 
465 470 475 480 

2S TAT ATA GAA AAT GAC TTC CCT ATA AAT GAA TTA ATT TTA GAT ACT GAT 

Tyr He Glu Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Aso 
485 490 495 

TTA ATA AGT AAA ATA GAA TTA CCA AGT GAA AAT ACA GAA TCA CTT ACT 
M* Leu lie Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 

5 °0 505 510 

GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC GCT ATA AAA 
Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lvs Gin Pro Ala He Lys 
-° 515 520 " 525 

AAA ATT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 
Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
530 535 540 

ACA TTT CTC TTA GAT ATA AGA GAT ATA AGT TTA ACA TCT TCA TTT GAT 
Thr Phe Leu Leu Asp He Arg Asp He Ser Leu Thr Ser Ser Phe Asp 
545 550 555 560 

43 GAT GCA TTA TTA TTT TCT AAC AAA GTT TAT TCA TTT TTT TCT ATG GAT 

Asp Ala Leu Leu Phe Ser Ann Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

TAT ATT AAA ACT GCT AAT AAA GTG GTA GAA GCA GGA TTA TTT GCA GGT 
M> Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 

580 585 590 

TGG GTG AAA CAG ATA GTA AAT GAT TTT GTA ATC GAA GCT AAT AAA AGC 
Trp Val Lys Gin He Val Asn Asp Phe Val He Glu Ala Asn Lys Ser 
^ 595 600 60S 

AAT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA 18 72 

Asn Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tvr lie 
610 615 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1920 



1968 



2016 
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660 «5 670 



X^stsjs si fisj-ffss." si 

25 SI Si Si 5? SI ■ j£ SI? SIS? E iS S S SK S? 

700 

2S Ki S KI SI S Si- IH * S ile £ SK S? IE SI 

s; s is si si sx s as s z= S si s: is si 5 
is Si "i is si is ss Si S5-5 s si s i£ ™ 

HI i= 2J K iil E J" S iK SS & S »S .5 SI iK 



760 765 



- ii - - « - » ^ s s is - ~ «? 

s - £ ~ ™ - - 5= ^ - - " «H 

£ SS Si Si g 3 12 i£ ?JJ J2 s] SJJ is Si 12 si 

610 815 

- - - i - Si SI Sii Si S Si St STSi i- ss 
Si S ill SI? S SI SI E JSi S SI K iil S 32 52 

25 iS - - - - s= - si s ss ;n a - - 



855 860 



875 



880 



910 



Sii iil fS iil ill i2 HI iil SI 51? IE - «; j™ ~ ~ 



920 925 



s: ?s s: AGA J2ss jji si is r r ggt ata caa — 

930 ^ , ° LyS Tyr ^VS.Asn Asp Gly He Gin Asn 

335 940 



2064 



!X12 



2160 



2208 



22 56 



2304 



2352 



2400 



2448 



2496 



!544 



2592 



2688 



si si a s? sii ss s si si s? ss s?s si si si £ 
si » iii si iii is si is si S si $ in « S si - 



2 784 



2832 
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20 



40 



50 



55 



60 



65 



TAT ATT CAT AAT GAA TAT ACA ATA ATT AAT TGT ATG AAA AAT AAT TCG 
Tyr lie His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 
945 950 9S5 960 

GGC TGG AAA ATA TCT ATT AGG GGT AAT AGG ATA ATA TGG ACT TTA ATT 
Gly Trp Lys He Ser lie Arg Gly Asn Arg He lie Trp Thr Leu lie 
965 970 • 975 



GAA GAT ATA TCA GAG TAT ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT 
Glu Asp He Ser Glu Tyr lie Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 



AAT AAT TTG AAT AAC GCT AAA ATT TAT ATT AAT GGT AAG CTA GAA TCA 
Asn Asn Leu Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser 
1010 *01S 10 2o 

AAT ACA GAT ATT AAA GAT ATA AGA GAA GTT ATT GCT AAT GGT GAA ATA 
Asn Thr Asp He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He 
1025 1030 1035 1040 

ATA TTT AAA TTA GAT GGT GAT ATA GAT AGA ACA CAA TTT ATT TGG ATG 
He Phe Lys Leu Asp Gly Asp lie Asp Arg Thr Gin Phe He Trp Met 
1045 1050 10 55 



4.^ ATT TTA ACA CGT AGC AAA TAT AAT CAA AAT TCT AAA TAT ATA AAT TAT 

He Leu Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lvs Tyr He Asn Tyr 
H25 H30 1135 



2880 



2928 



GAT ATA AAT GGA AAA ACC AAA TCG GTA TTT TTT GAA TAT AAC ATA AGA ?976 
10 Asp lie Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn lie Arc, 

980 985 990 * 



3024 



3072 



3120 



3168 



3264 



AAA TAT TTC AGT ATT TTT AAT ACG GAA TTA AGT CAA TCA AAT ATT GAA 3216 
M Lys Tyr Phe Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn lie Glu 

1060 1065 10 70 

GAA AGA TAT AAA ATT CAA TCA TAT AGC GAA TAT TTA AAA GAT TTT TGG 
Glu Arg Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 
"° 1075 1080 ioBS 

GGA AAT CCT TTA ATG TAC AAT AAA GAA- TAT TAT ATG TTT AAT GCG GGG 3 312 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 
1090 1095 1100 

AAT AAA AAT TCA TAT ATT AAA CTA AAG AAA GAT TCA CCT GTA GGT GAA 
Asn Lys Asn Ser Tyr He Lys Leu Lys Lys Asp Ser Pro Val Gly Glu 
1105 1115 1120 



3360 



3408 



AGA GAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT 3 4 56 

Arg Asp Leu Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn 
1140 H45 1150 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT 3 504 

Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Tvr 
1155 H60 ii 6 5 7 

CTA GAT TTT TTT AAT TTA AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA 3550 
Leu Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys 
1170 1175 1180 

TAT TTT AAG AAA GAG GAA GAA AAA TTG TTT. TTA GCT CCT ATA AGT GAT : 3600 
Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp 
1185 1190 1195 12 £ c 

TCT GAT GAG TTT TAC AAT ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG 3648 
Ser Asp Glu Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin 
1205 1210 12 15 
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3744 



3792 



s ?s ™ ~s a; s si se^ss » SS£ 

STf SS # £S SgSI 21 SI S S £™ £ E 

1255 1260 
AAA GAG GTA AAA AGG AAA CCA TAT aat ttr a n a 

jg^u ,«a ,v= g. ,g ?» «£ 22 « IS I?? 351 ^ SS 3640 

1275 ' 1280 

CAG TTT ATT CCT AAA GAT GAA GOG TGG ACT GAA TAA 

Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 3876 
1285 1290 

(2) INFORMATION "FOR SEQ .ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 amino acids 
(BJ TYPE: amino acid 
<Dj TOPOLOGY: linear 

i.iil' MOLECULE TYPE V protein 

(XI) SEQUENCE DESCRIPTION : SEQ ID NO:42: 

Met Pro Val Thr lie Asn Ann Phe Asn Tyr Asn Asp Pro He A S p Asn 

b 10 15 

Asn Asn He lie Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 

25 30 

Tyr Tyr Lys Ala Phe Ly S He Thr Asp. Arg He Trp He n e Pro Glu 

40 45 

Arg Tyr Thr Phe Gly. Tyr Lys Pro Glu Asp phe Asn Ly£; Ser ^ 

55 60 
lie Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

75 80 

Thr Asn Asp Ly S Lys Asn He Phe Leu Gin Thr Met He Lys Leu Phe 

90 95 

Asn Arg lie Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 

He Asn Gly H e Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 

120 125 
Phe Asn Thr Asn lie Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 

13b 14Q 

Pro Gly Glu Val Glu Arg Ly. Lys Gly He Phe Ala Asn Leu He Ile 



160 
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Phe Gly Pro GXy Pro Val Leu Asn Gl.u Asn Glu Thr lie Asp lie Gly 
165 170 . 175 

lie Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly lie Met Gin 
180 185 i 190 

Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Ash Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser lie Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

Ala Leu lie Leu Met His Glu Leu lie His Val Leu His Gly Leu Tvr 
225 230 23S , 2 io 

Gly lie Lys Val Asp Asp Leu Pro lie Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Ala He Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly Gin Asp Pro Ser lie He Thr Pro Ser Thr Asp Lys Ser He 
275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg L^u Asn 
290 295 300 

Lys. Val Leu val Cys He Ser Asp Pro Asn lie Asn lie Asn lie Tyr 
305 310 315 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly 
325 330 335 

Lys Tyr Ser He Asp Val Glu Ser Phe Asp Lys Leu Tvr Lys Ser Leu 
340 345 " 350 

Met Phe Gly Phe Thr Glu Thr Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 

Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lvs 
370 375 380 

Asn Leu Leu Asp Asn Glu He Tvr Thr He Glu Glu Gly Phe Asn He 
385 390 395 400 

Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Glv Gin Asn Lys Ala lie 
405 410 * 41S 

Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

Lys He Gin Met Cys Lys Ser Val Lys Ala Pro Glv He Cys He Asp 
435 440 " 445 

Val Asp Asn Glu Asp Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg He Glu Tyr Asn Thr Gin Ser Asn 
465 470 475 480 

Tyr He Glu Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asp 
485 ' 490 495 

Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lvs 
515 520 525 

Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
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530 535 

SJb 540 



« """ "* "' u ** p a: *■* -> ••- <>-< ^ «. «~ ph. *. P 

560 

A,P A*a Leu Leu Phe Ser Asn Lys Val T yr Ser phe ^ ^ ^ ^ 

' 575 

Tyr He Lys Thr Ala Asn Lvs Val Val Glu Al, n , 

580 ' lit la Giy Leu Phe Ala Gly 

585 590 
Trp val Lg.Gln ir. val Asn Asp Pne val Ile Glu AU ^ 

6 0 5 

Asn Thr Met Asp Lvs II** ai. n„ ,\ 

610 P " S ^ ASP Ile Ser Leu He val Pro Tyr He 

6 20 

Gly Leu Ala Leu Asn Val Glv Asn ri„ 

6 " 630 7 1U Thr ^ G1 X Asn Phe Glu 

' 640 
Asn Ala Phe Glu lie Ala Glv Ala Ser n« t r 

645 ' er li* Leu Leu G1 " Phe lie Pro 



Glu Lou Leu lie Pro Val Val Gl 



650 655 

y £« a Phe Leu Ucu Glu Ser Tyr 



660 ' ■ Ucu Giu Ser Tyr Il e 

b " 670 

Asp Asn Lys Asn L ys He xi. Lys Thr Ile Asp Asn ^ ^ ^ ^ 

^*|» Glu Lys Trp Ser Asp Me C Tyr Gly Leu Iie val Ala t 

700 



I-eu ser Thr Val Asn Thr Gin Phe Tvr Thr r, , 

705 no Pile Tyr Thr lie Lys Glu Gly Met Tyr 

Ly S Ai a Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu lie lle Lya 

Ar91Vr ASn V4 X 0 ^ ^ ^ *« A.n Xle Asn 111 Asp 

Phe Asn A=p n e Asn ser Lys Leu Asn Clu cly He Asn Z Ala Ile 
Asp Asn Xle Asn Asn Phe lie Asn Gly Cys Ser Val S " Tyr 
Lys Ly . Me , Ile Pro Leu Aia val Giu ^ ^ J" ^ ^ 

795 800 
Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asn Glu Ao„ , 

805 p sn L y s L,eu T yj- 

810 815 
-u Xle Gly ser Ala Clu Tyr Glu Lys Ser Lys Val Asn ^ ^ ^ 

Thr Xle Met Pro Phe Asp ,ou Ser Xle Tyr Thr Asn "p Thr Xle 



Leu lie Glu Met Phe Asn Lvs Tyr Asn Ser ri„ n r 

850 y ber Glu Leu Asn As 



860 



;n He 



Il| Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu Xle Asp Leu Ser Gly 
Tyr Gly Ala Lys Val G1 u Val Tyr Asp Gly Val Glu Leu Asn Asp L y l 



895 

Asn Gin Phe Lys Leu Thr Ser Ser Ala A^n <^ r 

900 Asn Ser L V S He Arg Val Thr 



910 
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Gin Asn Gin Asn He lie Phe Asn Set Val Phe Leu Asp Phe Ser Val 
915 920 .925 

Ser Phe Trp lie Arg lie Pro Lys Tyr Lys Asn Asp Gly lie G l n A sn 



940 



Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys 1 Asn Asn Ser 

950 9S5 960 

Gly Trp Lys lie Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
965 970 975 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn fie Arg 



990 



Glu Asp lie Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 

AS " 101O LeU ioT s Ile Ile ASn Lys Leu Glu Ser 

Asn Thr Asp He Lys Asp He Arg Glu Val Ile Ala Asn Gly Glu He 

1030 1035 i0 40 

He the Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe Ile Trp Met 
1045 1050 10 55 

Lyr, Tyr Phe Ser lie Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu 
1050 1065 i07o 

Glu Arg Tyr Lys lie Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 
1075 1080 1085 P 

G1V to90 Pr ° i095 LyS Tyr Me ^- Phe A5n Aia G1 V 



1100 



Asn Lys Asn Ser Tyr lie Lys Leu Lys Lys Asp Ser Pro Val Gly Glu 

1110 1115 1120 

He Leu Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lys . Tyr Ile Asn Tvr 
1125 1130 



1135 



Arg Asp Leu Tyr He Gly Glu Lys Phe Ile Ile Arg Arq Lys Ser Asn 
H40 1145 " 1150 

Ser Gin Ser lie Asn Asp Asp lie Val Arg Lys Glu Asp Tyr lie Tvr 
1155 neo 1165 

Leu Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tvr Thr Tyr Lys 
• 1175 1180 

Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp 
AB& H90 1195 1200 

Ser Asp Glu Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin 
1205 1210 1215 

Pro Thr Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 

1220 1225 1.230 

Asp Glu lie Gly Leu Ile Gly Ile His Arg phe Giu 
1235 1240 124 5 

Val Phe Glu Glu Tyr Lys Asp Tyr Phe Cys tie Ser Lys Trp Tyr Leu 
1250 J.2S5 12 60 

Lys Glu Val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp 

1270 127 S 12 eo 

Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
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1285 1290 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc DNA M 

<ix) FEATURE: 

(A) NAME /KEY: CDS 

<B) LOCATION: 108.. 1523 



PCT/US97/15394 



■(xi) SEQUENCE DESCRIPTION : SEQ ID NO:43: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 



Met Gly Hi; 
1 



s - - SI Sj SI gj £ 2? % SI jk ss % • 

1 - - si - a? si 2? s s £ ts sK si 21 > 

21 ss 22 s k 21 "i ;ri je i™ - e :s £ 2 

fiS'SS "I 21 Hi tli 21 Hi E.gj 51 SJ Si 2j SJ Si ' 

SI? K 2J SS Sf £ SI 21 21 & 21 Si K J" Hi s " 

SI K 22 21 £ IS K »; - S ~ ^ « »j f „ jjj , 

III 21 52 S SJ HI 21 HI JS SI « ™ ™ ™ - jjj „ 

X 1 0 ' 

SI 2i ™I KS 21 21 21 JK Si 21 51 ffi 21 21 Si "I » 

- - - S SI S 21 21 IS S5 S! £ S £ ffi " 

s ss 2i s s s sn s us ?n 21 s 21 » £ s •* 



AAA TCA GTA TTT TTT GAA TAT AAC ATA ACA GAA GAT ATA TCA GAG TAT 
Lys .ser Val Phe Phe Glu Tyr Asn He Arg Glu Asp lie ser ctu ™ 

170 175 
ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT AAT AAT TTG GAT AAT GCT 

- 300 - 
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10 



15 



f«n Arg TrP Phe Phe Val Thr Ile Thr Asn Asn Leu Asp Asn Ala 

AOU 18 * 190 19S 

AAA ATT TAT ATT AAT GGC ACG .TTA GAA TCA AAT ATG GAT ATT AAA GAT 
Lys lie Tyr lie Asn Gly Thr Leu Glu Ser Asn Wet Asp lie Lys Asp 

ATA GGA GAA GTT ATT GTT AAT GGT GAA ATA ACA TTT AAA TTA GAT GGT 
He Gly Glu Val He Val Asn Gly Glu lie Thr. Phe Lys Leu Asp G1V 
215 220 225 

GAT GTA GAT AGA ACA CAA TTT ATT TGG ATG AAA TAT TTT AGT ATT TTT 
Asp Val Asp Arg Thr Gin Phe lie Trp Met Lvs Tvr Phe Ser lie Phi 
230 235 ' ' 

AAT ^ TTA AAT CAA TCA AAT ATT AAA GAG ATA TAT AAA ATT CAA 

Asn Thr Gin Leu Asn Gin Ser Asn lie Lvs Glu lie Tyr Lys lie GlS 

" 250 25S 

TCA TAT AGC GAA TAC TTA AAA GAT TTT TGG GGA AAT CCT TTA ATG TAT 
Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro Leu Met Tyr 
^ 60 26 5 2 70 2 

AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG AAT AAA AAT TCA TAT ATT 
Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lvs Asn Ser Tyr lie 
280 285 2 9o 

AAA CTA GTG AAA GAT TCA TCT GTA GGT GAA ATA TTA ATA CGT AGC AAA 
Lys Leu Val Lys Asp Ser Ser Val Gly Glu He Leu Ile Arg Ser Lvs 
295 300 3 0 I 

TAT AAT CAG AAT TCC AAT TAT ATA AAT TAT AGA AAT TTA TAT ATT GGA 
Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr Arg Asn Leu Tyr lie uly 
310 315 320 * 

GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT TCT CAA TCT ATA AAT GAT 11?4 
Glu Lys Phe lie He Arg Arg Glu Ser Asn Ser Gin Ser lie Asn Asp 
J -- 5 330 335 

GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT CTA GAT TTG GTA CTT CAC 117- 
Asp I,e val Arg Lys Glu Asp Tyr lie His Leu Asp Leu Val Leu His 
i40 345 3so 3S5 

4S 5 AT TG ° AGA GTA TAT GCC TAT AAA TAT TTT AAG GAA CAG GAA 1220 

His Guu Glu/rrp Arg Val Tyr Ala Tyr Lys Tyr Pho Lys Glu Gin Glu 
360 365 J70 

GAA AAA TTG TTT TTA TCT ATT ATA AGT GAT TCT AAT GAA TTT TAT AAG 126 8 

Glu Lys Leu Phe Leu Ser Ile lie Ser Asp Ser Asn Glu Phe Tyr Lys 
375 380 385 

t£J rT A GAA TAT GAT GAA CAG CCA TCA TAT AGT TGT CAG 1316 

Thr lie Glu Ile Lys Glu Tyr Asp Glu Gin Pro Ser Tyr Ser Cys Gin 

- - j j\j 395 ~ 



40 



60 



400 



TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAT ATA GGA TTG ATT 
Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp lie Gly Leu lie 
405 410 - — 



415 



G?3 tTI «f T ^ T 11° l AC G ^ TCT GGA GTT TTA CGT AAA AAG TAT AAA 
Gly Ile His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lys Lys Tyr Lvs 

4aU 425 430 435 



740 



768 



836 



884 



932 



980 



1028 



1076 



1364 



1412 



■6S a I ll T TGT ATA AGT *** TGG TAC TTA AAA GAG GTA AAA AGG AAA 1460 

to Asp Tvr Phe Cys lie Ser Lys Trp Tyr Leu Lys Glu Val Lys Arg Cys 

440 445 450 

CCA TAT AAG TCA AAT TTG GGA TCT AAT TGG CAG TTT ATT CCT AAA GAT 150R 

?() Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Pho lie Pvo Ly^ Asp 

460 46 g 
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23 



40 



50 



70 
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GAA ggg tgg act gaa taa 

Glu Gly Trp Thr Glu 1526 
4 70 

3 (2) INFORMATION FOR SEQ ID NO-44 

• ■ ' ' * . i 

(l) SEQUENCE CHARACTERISTICS- 

(A> LENGTH: 4 72 amino acids 
in (B) TYPE : amino acid 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 
|5 (Xi> SE WENCE DESCRIPTION: SEQ IDNO:44 : 

Gly His His His His His His His His H iS His Ser Ser Gly His 

He Glu Gly Ar* His Met Ala Ser Met Ala Asp Thr He Leu Z cl 

Met Pne Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He »" Leu Asn 

4U 45 

Leu Arg Tyr Arg Asp Asn Asn Leu lie Asd ' <z„ rM , 

SO Asp Leu ^er Gly Tyr Glv Ala 

60 

Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys Asn Gin Phe 
Lys Leu Thr Ser Ser Ala Asp Ser Lyo lie Arg Val Thr Gin Asn 'all 
Asn Xle lie p he Asn scr Mec phe ^ ^ ^ ^ t ^ 

Xl.-A.r9 lie Pro Lys Tyr Ar g Asn Asp Asp xl e Qln Asn Ty l „, His 

12 5 

A-n. Glu Tyr Thr lie i le Asn Cys Met Lys Asn Asn Ser Gly Trp Lys 

1Jb 140 

lie Ser lie Arg Gly Asn Arg He lie Trp Thr Leu Ile Asp Ile ^ 

155 160 

r.ly Lys. Thr Lys Ser Val Phe Phe Glu Tyr Asn lie Arg Glll Asp Ile 

5fir Ht ASn Arg ^ fjj- V.1 Thr Xle Thr Asn Asn Leu 

185 190 

Asp Asn Ala Lys He Tvr Ile Asn niv tk. t ^ 

195 y il€ G1 Y Thr Leu Glu Ser Asn Met Asp 

2O0 2Q5 

55 He Lys Asp Xle Gly Glu Val He Val Asn Gly Glu Xle Thr Phe Lys 

215 22Q 

Leu Asp Gly Asp Val Arg Thr ^ e t ^ ^ ^ ^ 

60 23 5 24 0 

Ser lie Phe Asn Thr Gin Leu Asn Gin Ser Asn lie Lvs Glu He Tvr 

250 " 255 > 

65 Ue Gln HI ^ Ser G1 " T vr Leu Lys Asp Phe Trp Gly Asn Pro 

- 65 270 
Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn 

280 285 
Ser Tvr lie Lys Leu Val Lys Asp Ser Ser Val Gly Glu lie Leu lie 

- .102 - 
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290 295 300 

Arg Ser Lys Tyr Asn Gin Asn Ser Asri Tyr lie Asn Tyr Arg Asn Leu 
5 305 310 315 1 • 320 

Tyr lie Gly Glu Lys Phe lie lie Arg Arg Glu Ser Asn Ser Gin Ser 
325 330 335 

He Asn Asp Asp lie Val Arg Lys Glu Asp Tyr He His Leu Asp Leu 
,U 340 345 350 



15 



20 



30 



35 



45 



50 



65 



70 



Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lys Tvr Phe Lvs 
355 3 6 o 365 ' 

Glu Gin Glu Glu Lys Leu Phe Leu Ser lie lie Ser Asp Ser Asri Glu 
3 70 3 75 380 

Phe Tyr Lys Thr He Glu He Lys Glu Tyr Asp Glu Gin Pro Ser Tyr 
385 390 395 4oo 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He 
405 410 4i5 

Gly Leu He Gly He His Arg Phe; Tyr Glu Ser Gly Val Leu Aro Lvs 
420 425 430 " 

Lys Tyr- Lys Asp Tyr Phe Cys He Ser Lys Trp Tvr Leu Lys Glu Val 
435 440 * 445 

Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe He 
450 455 4 6 o 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 470 

<4) INFORMATION FOR SEQ ID NO:45: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1547 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



tix) FEATURE: . 

(A) NAME / KEY : CDS 

(B) LOCATION: 108.. 1523 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 ; 

AGATCTCGAT CCCGCGAAAT T AAT AC G ACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
l 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 16 4 

His His His His His His His His His Ser Ser Gly His He Glu Gly 
° {) 5 io 15 



CGT CAT ATG GCT AGC ATG GCT GAT ACA ATA CTA ATA GAA ATG TTT AAT 

Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu Met Phe Asn 

20 25 30 35 

AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT ATC TTA AAT TTA AG A TAT 

Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn Leu Arg Tyr 

40 45 50 

AAG GAT AAT AAT TTA ATA GAT TTA TCA GGA TAT GGG GCA AAG GTA GAG 



212 



260 



308 
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Lys Asp Asn Asn Leu He A^n t^i. c^*- ' 

5s lie Asp Leu Ser Gly Tyr Gly Ala Lys Val Glu 

60 65 

GTA TAT GAT GGA GTC GAG CTT AAT rar n** 

v al Tyr %S p Gly val „; i£ A % T 2£ - T aj/S SS S S 



7S 80 



Sel K J£ Ser j£ «™ ^ CJA- AAT CAG AAT ATC ATA 

8S y I® Arg val Thi Gin Asn Gin Asn He He 

yu 95 



s k e si 2; sj s - - s =» j» « ™ 
is k a si s: s. s si si ™ sj » e sk S is. 
» « s - - - s is tir ~ ~ gaSs s 

^ IS SIf SI Si 3?I SS t2 i£ S* 2" ?T* ™ °"° ™ 

165 AS ° 1Ae Ar 9 Glu Asp He Ser Glu Tyr fi 

lie £1 ^ ?S -I Phi $3 ^ ?« S Asn ^ ^ ^ *~ GCT 

180 1R c iAe Thr Asn Asn Leu Asn Asn Ala 



190 



£ ?Z TOI KI SI SS £ £2 SK K IS S SI - - - 

s js si si si si ss sk ™ s: sn « ™ ^ - 

~ 2 O 2 2 S 

GAT ATA GAT AGA ACA CAA ttt att tv^ „ rt . 

ASP XXe Asp A r g Tnr SK J" ill Trp ss i£ sj s s iH-SS 

jjs sj s A ° r s it; in a* aga ™ ^ att 

24 5 ^ Uln |er Asn He Glu Glu Arg Tyr Lys lie Gin 

TCA TAT AGC GAA TAT TTA AAA GAT TTT Trr rr-* 

Ser Tyr Ser Glu Tyr Leu Lys Asp III g?C £* T " A ATG TAC 

260 26s Y p ^ ne Tr P G1 y A sn Pro Leu Met Tyr 

SI £ Si i 51? HI si xs s; si - - ~ - S 

265 29Q 

£ Z2 K ^ E S ™ ^ SK K ™ £ SI K - 

T T ?i si =3 ts e k s si ^ m ™ 5 K ~ 

315 320 



35€ 



404 



IS - S - - E - - S S S S 2S S £5 K «' 

110 115 



500 



548 



596 



644 



692 



740 



788 



836 



884 



932 



980 



1028 



1076 
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10 



30 



40 



50 



GAA AAA TTT ATT ATA AG A AG A AAG TCA AAT TC.T CAA TCT ATA AAT GAT H24 
Glu Lys Phe lie lie Arg Arg Lys Ser Asn Ser Gin Ser lie Asn Asp 
325 330 335 ^ 



(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
1 5 io 15 

5:> lie Glu Gly Arg His Met Ala Ser Met Ala Asp Thr lie Leu lie Glu 

20 25 30 

Met Phe Asn Lys Tyr Asn Ser Glu lie Leu Asn Asn lie He Leu Asn 

60 

Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Gly Tvr Glv Ala 
50 55 60 ~ 

Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lvs Asn Gin Phe 
C ° 65 70 75 80 

Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr Gin Asn Gin 
Q5 go 95 

70 Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val Ser Phe Trp 

- 305 - 



1268 



1316 



GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT CT A GAT TTT TTT AAT TTA 117 2 

As P i;e Val Arg Lys-Glu Asp Tyr He Tyr Leu Asp Phe Phe Asn Leu 
340 345 350 ' 355 

AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA TAT TTT AAG AAA GAG GAA i^n 
Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Phe Lys Lys Glu Glu 
360 365 37 0 

GAA AAA TTG TTT TTA GCT CCT ATA AGT GAT TCT GAT GAG TTT TAC AAT 
Glu Lys Leu Phe Leu Ala Pro lie Ser Asp Ser Asp Glu Phe Tyr Asn 
tD 37 5 380 385 

ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG CCA ACA TAT AGT TGT CAG 
Thr He Gin lie Lys Glu Tyr Asp Glu Gin Pro Thr Tyr Ser Cvs Gin 
2(} 390 395 400 

TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAG ATA GGA TTG ATT 1264 
Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Glu He Gly Leu He 
405 410 415 

25 GGT ATT CAT CGT TTC TAC GAA TCT GGA ATT GTA TTT GAA GAG TAT AAA 1412 

Gly He His Arg Phe Tyr Glu Ser Gly He Val Phe Glu Glu Tvr Lvs 
420 425 430 435 

GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA AAA GAG GTA AAA AGG AAA 1460 
Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val Lys Arq Lvs 
44 ° 445 y 4 50 

CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 15 08 

Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe He Pro Lys Asp 
455 460 465 

GAA GGG TGG ACT GAA TAAAAGCTTG CGGCCGCACT CGAG 
Glu Gly Trp Thr Glu 
4 70 

(2) INFORMATION FOR SEQ ID NO: 46: 



1547 
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He Ar g xle Pro L ys Tyr Lys Asn Asp Gly Ile Gln ^ Tyr ^ ^ 
Asn Clu T y r Thr Xle X!e Asn Cys „ et Lys Asn Asn ^ G j y T ^ 

J5 140 

lie ser ile. Ar 9 Gly Asn Ar g Ile xle Trp Thr Leu He Asp H e f n 
oiy L ys Thr L ys ser val Phe P„ e olu Tyr J^. Arg ^ ^ ^ 

Ser oiu Tyr.ll. Asn Arg Trp Phe Phe Val Thr xle Thr Asn Asn Leu 

185 190 
Asn Asn Ala Lys xle Tyr Xle Asn Gly Lys Leu Glu f „ Asn Thr ^ 

He Lys Asp Xle Ar g Glu Val xi e Ala Asn ci y « u xi^ xle Phe Lys 

^ Asp Gly Asp Xle A f p Arg Thr Cln Phe Xle Trp Met Lys Tyr ^ 

^ ^ Si G1U LeU S - ™» Ser Asn Xle Glu Glu Arg Z 

2 5 5 

Lys Xle Gin Ser Tyr Ser G!u Tyr Leu Lys Asp Phe Trp Gly L Pro 
-eu M et Tyr Asn Lys Qlu Tyr ^ ftia As 7 ° ^ 

~ 80 285 
Ser Tyr Xle Lys Leu Lys Lys Asp Ser Pro Val Gly Glu n- ^ 

300 

Ar f Ser Lys Tyr Asn Gin Asn Ser Lys Tyr Xle Asn Tyr Ar g Asp Leu 
Tyr Xle Gly Glu Lys Phe xle Xle Arg Arg Lys Ser Aan s „ ^ ^ 
He Asn Asp Asp Xle Val Ar g LyS Glu Asp Tyr xle Tyr Leu Asp Phe 
Phe Asn L.u Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Ph e Lys 

L V3 Glu Glu Glu Lys Leu Phe Leu Ala Pro Xle Ser Z Ser Asp olu 

J/b 380 
Phe Tyr Asn Thr Xle Gin Xle Lys Glu Tyr Asp Glu Gin Pro Thr Tyr 

395 400 
Ser cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Glu xle 

Gly Leu Xle Gly Xle His Ar g Phe Tyr Glu Ser Gly He Val Phe Glu 

Glu Tyr Lys Asp Tyr Phe Cys XXe Ser Lys Trp Tyr Leu ly s G Xu VaX 

440 445 



^3 Arg Ly S Pro X yr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe lie 

460 



Pro Lys Asp Giu Gly Trp Thr Glu 
465 47£) 
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50 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



CGCCATGGCT GATACAATAC TAATAGAAAT G 3l 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
-V (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(Dl TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GCAAGCTTTT ATTCAGTCCA CCCTTCATC ™ 
_>0 ^ 9 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3753 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY; linear 

t/ . tii) MOLECULE TYPE: DNA (genomic) 

(xx) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3750 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 



ATG CCA ACA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT AAT AG A 
Met Pro Thr He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Arg 
15 io 



15 



70 



GAT AAA TTT TTA AAA ATA GTC ACA AAA ATA TTT AAT AGA ATA AAT GAT 
Asp Lys Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asp 
85 90 95 



48 



96 



144 



ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA CAA TTT TAT AAA TCA 
Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 
20 25 30 

20 TTT AAT ATT ATG AAA AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 

Phe Asn He Met Lys Asn He Trp He lie Pro Glu Arg Asn Val He 
■*5 40 4 I 

GGT ACA ATT CCC CAA GAT TTT CTT CCG CCT ACT TCA TTG AAA AAT GG A 19' 
00 Gly Thr He Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Glv 

5° 55 60 

GAT AGT AGT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT CAA GAA AAG 
Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lvs 
°- 65 70 75 80 



240 



288 



- 307 - 
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AAT CTT TCA GGA AGG ATT tta tta p»» '^w 

1Ub 110 

a fs si si si s s si s?j s< e sj s«: is - 
s s s sir s s a: e s si sj is S e s si 

1 J!> 140 

Si S SI SIT S !K E SS S SS S SI Si ESK.E 

£ s e js sj. ui a » - - | s ~ * s 

S?I SI SS S K E.S 51? E 5? E E ; « IJ £- 
J25 E K SI SI S - £J SK EMI Si SI £ - - 



200 205 



- xs -« - - s a sti e si - ~ a - .« s 
i s? s s e ss si - ss ss £ s; Eg si 



240 



j£ ™ j£ -J ^ S£ ^ ^ ^ ™ A " «T GGA 

^^y ^y inr Abn lie Glu Glu Phe Leu Thr Phe Gly 



255 



ss £ «p is e tn ?ii s ssi s? e si si s - 



26 5 27b - 



Thr A^T T T f TA GCT GAT TAT ^ AAA ATA GCG TCT AAA CTT ACC m 
inr Asn Leu Leu Ala Aso Tvr i „<- i .. _ • _ , ****** AGC AAA 

275 ASP TV1 ^= ^ He Ala Ser Lys Leu S er Lys 

?™ SK & E "I Si SI.SI SI k 5 SIT E « 



290 



300 



£n Lys Atn A^p K E ™ ™C ACC TTT ACG GAA 

y ne Asn A.p lie Phe Lys Lys Leu Tyr Ser Phe Thr Glu 



325 330 

330 335 



E 21 Si S S Si SI Si SI K E E SK SI I" K 
SS SIS % £ E E £ SI E SI SS 25 SI SI E SI 

360 3€5 

5£ 2£ ?X ser £J ™ A - -T AAT TTA AAG GTA AAT TTT 

y Y A ° n Ile Asn As " Lys Val Asn Phe 



432 



480 



S28 



576 



624 



672 



720 



76B 



816 



864 



GAA 912 



S £ S?? Si S SI E E SS K lil E « - - 

315 320 



1O08 



1056 



1104 



1152 
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370 375 380 

AGA GGA CAG AAT GCA AAT TTA AAT CCT AGA ATT ATT ACA CCA ATT ACA 
Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr, Pro He Thr 
385 390 395 ! 400 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 
Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn lie Val 
405 410 415 

TCT GTA AAA GGC ATA AGG AAA TCA ATA TGT ATC GAA ATA AAT AAT GGT 
Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

GAG TTA TTT TTT GTG GCT TCC GAG AAT AGT TAT AAT GAT GAT AAT ATA 
Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tvr Asn Asp Asp Asn He 
435 440 445 

AAT ACT CCT AAA GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 
Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
450 455 460 

GAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT AGT GAA TCA GCA 
Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
1,65 470 475 5 480 

CCT GGA CTT TCA GAT GAA AAA TTA AAT TTA ACT ATC CAA AAT GAT GCT 
Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

TAT ATA CCA AAA TAT GAT TCT AAT GGA ACA AGT GAT ATA GAA CAA CAT 
Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 

GAT GTT AAT GAA CTT AAT GTA TTT TTC TAT TTA GAT GCA CAG AAA GTG 
Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lvs Val 
51S ~ 520 525 

CCC GAA GGT GAA AAT AAT GTC AAT CTC ACC TCT TCA ATT GAT ACA GCA 
Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 
530 535 540 

TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT 
Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe I»e 
545 550 555 560 

AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 1728 
Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
565 570 575 

CAA CAA GTA TTA GTA GAT TTT ACT ACT GAA GCT AAC CAA AAA AGT ACT 
Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

GTT GAT AAA ATT GCA GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 18 24 

Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 6 05 

GCT TTA AAT ATA GGA AAT GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 18 72 

Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lvs Asp Ala 
6l ° 615 620 " 

CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CTT 1920 
Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
625 630 635 640 

TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA 196 8 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
64S 650 655 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1776 
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24 00 
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26 40 



2688 



2736 



2784 
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930 935 940 

GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT 
Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He lie 
945 950 955 960 

TGG ACA TTG CAA GAT AAT TCA GGA ATT AAT CAA AAA TTA GCA TTT AAC 
Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

TAT GGT AAC GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 
Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp ULe Phe 
980 965 990 

GTA ACT ATA ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT 
Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie Asn 
995 1000 1005 

GGA AAT TTA ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT 
Gly Asn Leu lie Asp Lys Lys Ser lie Leu Asn Leu Gly Asn He His 
1010 1015 1020 

GTT AGT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA 
YSL Ser Asp Asn Ile Leu Phe Lys lie Val Asn Cys Ser Tyr Thr Ara 
1025 1030 1035 1O 40 

TAT ATT GGT ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA 
Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 105O 1055 

ACA GAA ATT CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA 
Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
1060 1065 1070 

AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 
Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tvr Leu 
1075 1080 108 5 

TTA AAT GTG TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT 
Leu Asn Val Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser 
1090 1095 iioo 

ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 3 360 

T,o\ LeU Ser Ile Asn Asn Ile Ara Ser Thr Ile Leu Leu Ala Asn Arq 
1105 iHO 1115 1120 

TTA TAT AGT GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT 3 4 08 

Leu Tyr Ser Gly He Lys Val Lys Ile Gin Arg Val Asn Asn Ser Ser 
H25 H30 H35 

ACT AAC GAT AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT 3 456 
Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr Ile Asn Phe 
H40 115Q 

GTA GCC AGC AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC 3 504 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 H60 H65 

ACA AAT AAA GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT 
Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Ara Phe 
1170 " H75 H80 

AAT CAA GTA GTA GTT ATG AAT TCA GTA GGA TGT ACA ATG AAT TTT AAA 
Asn Gin Val Val Val Met Asn Ser Val Gly Cys Thr Met Asn Phe Lvs 
1185 1190 H95 !200 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



3552 



3600 
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3648 



3696 



3744 



AAT AAT AAT GGA AAT AAT att rr^n « ' 

- «. *,„ „ y s s- IS s? 3 S S?3 S 2£ Si «, s 

vIJS SI SS™ 1 ™' «» AGA GAT A^T ACA^AAC 

1220 P y Vr [„c; HlS Met Ar * As P Asn Thr Asn 

1230 

AGC AAT GGA TTT TTT TGG AAC TTT ATT TPT r-na M * 

s« ox^-he «. Trp "J £ « .«J -J CJA JJ. C.A 

A ^* U 1245 
GAA AAA TAA 

Glu Lys 3753 
1250 , °V 

(2) INFORMATION FOR SEQ ID NO: 50: 

<i) SEQUENCE CHARACTERISTICS • 

Jo! if NGTH: 1250 a*ino" acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50 : 

Met Pro Thr He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Ar g 

Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 

" 30 
*he Asn He Met Lys Asn Ile Trp Ile Ile Pro Glu ^ ^ ^ ^ 

Cly Thr lie- Pro Gin Asp Ph Leu Pro pro ,. hr ^ J* 

bb 60 ■ 

Asp Ser ser Tyr Tyr Asp Pro A S „ Tyr Leu Gin Ser Asp Gln Glu Lyg 

Asp L ys Phe Leu Ly| rle val ^ ^ ftsn ^ Asn Jp 

Asn Lou Ser Gly Ar g He Leu Leu Glu Olu Leu Ser Lys Ala As^ Pro 

105 110 
Tyr Leu Gly Asn Asp Asn Thr Pro Asp Gly Asp Phe lie He Asn Asp 

Ala ser Ala Val Pro lie Gin Phe Ser Asn Gly Ser 0 J Ser Lfiu 

Leu Pro Asn Val lie n e „ et Giy Ala Glu Pro Asp Leu Phe Glu Thr 

Asn ser Ser Asn lie Ser Leu Ar g Asn Asn Tyr Met Pro Ser Asn Z 

«y Phe Gly ser tie Ala Xle Val Thr Phe Ser Pro Clu Tyr S^r Phe 

^ ^ \ V sl ASP Met G ^ ^ «e Gin Asp Ala Leu 

Thr Leu Met His Glu Leu Xle His Ser Leu His Gly Leu Gly Ala 

215 220 
^3 Gly i le Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
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: 225 230 



235 240 



lie Thr Asn lie Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 ,255 

! " 

Gly Thr Asp Leu Asn He lie Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lys Leu Ser Lvs 
27S 280 285 

Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Hhe Glu 
290 295 300 

Ala Lys tyr Gly Leu Asp Lys Asp Ala Ser Gly lie Tyr Ser Val Asn 
30S 310 315 320 

He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
32S 330 335 

Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser lie 

-"* 5 . 5 360 365 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
370 375 380 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He lie Thr Pro He Thr 
385 390 395 400 

Gly Arg Gly Leu Val Lys Lys lie He Arg Phe Cys Lys Asn He Val 
405 410 415 

Ser Val Lys Gly lie Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 — - 



430 



Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
^35 440 445 

Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tvr 
4 50 455 460 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 

Asp val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 
515 520 525 

Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 

535 540 

Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
545 550 555 560 

Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
565 570 575 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 6 05 
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Ala LeuAsn , U Gly ^ GJu Ala Gln ^ ^ ^ ^ ^ ^ ^ 
C lu W Leu Gly ^ Gly rle ^ ^ Glu Phi clu Pro G l u Leu 

Leu He Pro Thr Leu >al Phe Thr „ L ys Ser Phe Leu Gly ^ 

650 655 
Ser Asp Asn Ly§ Asn Lys Val ^ ^ ^ w ^ ^ ^ 

•«lu Ar* Asp CXu L ys Trp Lys Glu val Tyr Ser Phe ^ Zl ser 
Trp H.t Thr- Ly. Ile Asn ^ Gin phe ^ ^ g ^ ^ 

Tyr Gin Ala L eu Gln W Gln Val Asn Ala Leu Ly I Ala He Ile GXu 

S6r LyS T ^ T ^ ^ ^ «« ^ Thr Hi 

LyS Tyr ASP G1 " "« - ^ Olu L eu Asn Gln L ys VaX ser 

Xle Ala Asn Asn Iie Asp Arg Phe Leu Thr Glu Ser Ile Ser 

bU 765 
Tyr Leu Met Lys Leu lie Asn n„ i 

770 - 775 G1U Val Lys Iie Asn Lys Leu Arg Glu 

* 780 

Tyr Asp Glu Asn Val Lys Thr Tvr r T 

785 inr ^ Leu Leu Asp Tyr Xle lie Lys His 

795 800 

Oly ser Ue U,C1, Glu Ser OIn Gln cl Leu Asn ^ ^ ^ ^ 

810 815 
Asp Thr Leu As,, Asn Ser Xle Pro Phe Lys Leu Ser Ser ^ Thr Asp 

A SP Ly . „ s Leu Ile Sfir Tyi h ^ ^ ^ l ^ ~° ^ e ^ 

040 845 

III Sei " LeU Asn ^ Arg xyr L ys Asn Asp Lys Tvr Va! Asp 

OT> * 860 * * 

lei Gi/ ASP S e % AS " ASn " e AS " G1 >' A=P Val -xyr Lys 

875 880 
Tyr Pro Thr Asn Lys Asn Gln Phe Gly U e T vr A^-n t 

885 7 oi^ Y Sp Leu Ser 

890 895 

Glu Val Asn lie Ser Gln Asn Asp Tyr lie' He Tvr * r 

900 e lie T V r As P Asn Lys Tyr 

910 

Lys Asn Phe Ser He Ser Phe Tro V*l am n „ 

915 6 gfg Val Ar 5 He Pro Asn Tyr Asp Asn 

Lys lie Val Asn Val Asn Asn Glu Tvr Thr run A 

930 c^t; Lyr rnr Iie r le Asn Cys Met Arq 

940 

Asp Asn Asn Ser Gly Trp Lvs V^l <z*r- t * 

945 y Uy ° Val Ser Leu Asn His Asn Glu He He 

955 960 
Trp "Thr Leu Gln Asp Asn Ser oly He Asn Gln Lys Leu Ala Phe Asn 

Tyr Gly Asn Ala Asn Gly „. ser Asp Tyr He Asn Lys Trp 11^ Phe 
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980 985 990 

Val Thr lie Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 100 S 

Gly Asn Leu He Asp Lys Lys Ser lie Leu Asn Leu Gly Asn He His 
1010 1015 1020 ' 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Ara 
1025 1030 1035 10 40 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

Leu Asn Val Leu Lys Pro Asn Asn Phe lie Asn Arg Arg Thr Asp Ser 
1090 1095 iioo 

Thr Leu Ser He Asn Asn He Arg Ser Thr lie Leu Leu Ala Asn Ara 
1105 mo ins X1 20 

Leu Tyr Ser Gly lie Lys Val Lys lie Gin Arg Val Asn Asn Ser Ser 
1125 1130 1135 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 H45 H50 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

Thr Asn Lys Glu Lys Thr lie Lys lie Ser Ser Ser Gly Asn Arg Phe 
1170 1175 1180 

Asn Gin Val Val Val Met Asn Ser yal Gly Cys Thr Met Asn Phe Lys 
1185 H90 H95 1200 

Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp Thr 
1205 1210 1215 

Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn 
1220. 1225 1230 

Ser Asn Gly Phe Phe Trp Asn Phe lie Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 

Glu Lys 
1250 

(2) INFORMATION FOR SEQ ID NO: 51: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3759 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..3756 

(xi) SEQUENCE DESCRIPTION ; SEQ ID NO:51: 
ATG CCA AAA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT GAT AGA 
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ASP A „ s „ „ e = sj; ss s: si si ss si; 

205 
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96 



144 



192 



24 0 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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vl? rt^ SIM" S CA CTT AAT CCT TAT AAA GAT GTT TTT GAA 

Gin val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 

295 300 

GCA AAG TAT GGA TTA GAT AAA GAT GCT AGC GGA ATT TAT TCG GTA AAT 
Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly lie Tyr, Ser Val Asn 

' rT£ " T ™ T GAT ATT TTT AAA AAA TTA TAC AGC TTT ACG GAA 

I*e Asn Lys Phe Asn Asp lie Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
325 330 335 

TTT GAT TTA GCA ACT AAA TTT CAA GTT AAA TGT AGG CAA ACT TAT ATT 
Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr ™J iH 
40 345 350 

GGA CAG TAT AAA TAC TTC AAA CTT TCA AAC TTG TTA AAT GAT TCT ATT 
Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu A^n As£ s" i" 

360 3 65 

TAT AAT ATA TCA GAA GGC TAT AAT ATA AAT AAT TTA AAG GTA AAT TTT 
Tyr Asn lie Ser Glu Gly Tyr Asn He Asn Asn Leu Lvs vll Pne 
u ; 375 360 

AGA GGA CAG AAT GCA AAT TTA AAT CCT AGA ATT ATT ACA CCA ATT ACA 
Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro ill Thr 
85 390 395 400 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 
Gly Arg Gly Leu Val Lys Lys lie i le Arg Phe Cys LyW £J iVe Val 
405 410 415 

Ser vll n? C tT* ^ *** TCA ATA T ^T ATC GAA ATA AAT AAT GGT 

Ser Val Lys Gly Ile Arg Lys Ser Ile Cys Il e Glu He Asn Asn Gly 
420 425 4 30 Y 

GAG TTA TTT TTT GTG GCT TCC GAG AAT ACT TAT AAT GAT GAT AAT ATA 
Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Aso Asp Asn He 
435 440 445 

AAT ACT CCT AAA GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 
Asn Thr Pro Lys Glu lie Asp Asp Thr Val Thr Ser Asn Asn Asn Tyl 

GAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT AGT GAA TCA GCA 
Glu Asn Asp Leu Asp Gin Val Ile Leu Asn Phe Asn Ser Glu Ser Ala 
. 470 4 ?5 480 

f TT I CA 2** *** TTA AAT ACT ATC CAA AAT GAT GCT 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr Ile Gin Asn Asp Ala 
485 490 495 

TAT ATA CCA AAA TAT GAT TCT AAT GGA ACA AGT GAT ATA GAA CAA CAT 
Tyr lie Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 

GAT GTT AAT GAA CTT AAT GTA TTT TTC TAT TTA GAT GCA CAG AAA CTr 
Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp A^a Gin i£ VaT 
51S 520 5 25 

CCC GAA GGT GAA AAT AAT GTC AAT CTC ACC TCT TCA ATT GAT ACA GCA 
Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser lie AsJ Thr Ma 
" u 535 540 

TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT 
Leu Leu Glu Gin Pro Lys lie Tyr Thr Phe Phe Ser ler G^ £e ill 
545 550 555 ™e ^ 

AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 
Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp lie 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



14 4 0 



1488 



1536 



1584 



163: 



1680 



1728 
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565 



570 



si si ss si si? s s s sr sis « - - - 

585 5j90 

si s & ?s s ss ts s s sir si. s ?s s & si 

600 605 

E SI HI S S HI SI SI SI « £1 E - «j s 
SI SI SI HI S5 |2 -» £ ™ ™ 5 SK.SS SS K 
SI S S S ill SI SIf .22 S S 5 E HI E - ™° 



6S0 6S5 



E 2? ts £ SI ^ SIT in g s s £1 ss s ™ - 

6 6 5. 



670 



SI « 2J Si J" £ SI Sli s SI i 2 ™ S a; 

680 685 

s «i| s m ?n m jg si K m c a; sts r 
£ .a: s- si si hi as si m s si 5 s. ?x in si 

S £ HI IS SI S™ "I SK - jg S 
HI - SI SI Si ill SI HI SI SI HI SI K S E 



745 750 



ill S II? "I HI i™ 2S IS SE HI S SI K 21 il* - 

760 765 



si si? his s xii si si hi ?n hi hi si s si 

HI SS HI Sf £ S SJ HI S SI £ "I i2 Si SI 

795 800 

S ?S ™ GGA «s §E G f T A ^ TCT ATG GTA — 

r Gln Gln - Glu Leu ft sn Ser Met Val Thr 



815 



825 830 



1776 

1824 

1872 

1920 

1968 

2016 
l 

2064 

2112 

2160 

2208 

2256 

2304 

2352 

2400 

2448 



2? £ SI III HI X ill K JZ £ E s SI £ ill SI »« 



GAT AAA ATT TTA ATT TCA TAT TTT AAT AAA TTC* ttt aa^ ^ * 
Asp Lys lie Leu Xle Ser Tyr Phe £1 ^ ^ j£ ^ ^ 



845 
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20 



50 



60 



65 
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AGT AGT TCA GTT TTA AAT . ATG AG A TAT AAA AAT GAT AAA TAC GTA GAT 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Aso 

8SO 855 860 

ACT TCA GGA TAT GAT TCA AAT ATA AAT ATT AAT GGA GA+ GTA TAT AAA 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 

86? 870 875 8B0 



885 890 895 

GAA GTT AAT ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 

Glu Val Asn He Ser Gin Asn Asp Tyr He lie Tyr Asp Asn Lys Tyr 

900 905 ■ 



910 



AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 
Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

AAG ATA GTA AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGA 
Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Ara 
930 935 9 4o " 



2592 



2640 



TAT CCA ACT AAT AAA AAT. CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 268S 
IU Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 

flfl C onn 



2736 



2784 



2832 



2880 



n-rt wirt uik i i lit AAT CAT AAT GAA ATA ATT 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu lie lie 
945 950 955 

TGG ACA TTG CAA GAT AAT GCA GGA ATT AAT CAA AAA TTA GCA TTT AAC ">928 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe 

965 970 975 

2 976 



GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT 

lie 
960 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

TAT GGT AAC GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 
Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr lie Asn Lvs Trp He Phe 
*° 980 985 ' 990 

GTA ACT ATA ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT 3024 
Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
4() 995 1000 1005 

GGA AAT TTA ATA GAT CAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT 3 0 7"* 

Gly Asn Leu lie Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 
1010 1015 1020 

43 GTT AGT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA 

Val Ser Asp Asn lie Leu Phe Lys He Val Asn Cvs Ser Tyr Thr Ara 
1025 1030 10 35 * 1040 

TAT ATT GGT ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA 
Tyr lie Gly lie Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

ACA GAA ATT CAA ACT TTA TAT AGC AAT GAA CCT AAT ACA AAT ATT TTG 3 216 

Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
DD 1060 1065 io70 



3120 



3168 



AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 3 26 4 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

TTA AAT GTG TTA AAA CCA AAT AAC TTT ATT GAT AGG AGA AAA GAT TCT 3 312 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser 
1090 1095 1100 

ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 3 36 0 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Ara 
1105 mo ins 

TTA TAT AGT GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT 34 08 

ni Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
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1125 

1130 1135 



3504 



3552 



3600 



3648 



3696 



ACT AAC GAT AAT PTT r-rr ' *^«'\..- 

Thr Asn Wp.ffi E SS ffi-2I.-SS °™ ™ AAT TTT 

1140 y y iTac P Gln Val J le Asn Phe 

1150 

v, s « s s ss a. s » * ~ s - s E « 

s ^ as - s ™ s 3 s 2j s „ 

£35 S 2? ST K !2 2 S3 JZ i» V S ™ - 

e £ is s g; a: - - « ™™ „ « „. 

SS S 23 ^ E K S g? JE ST ffi S iT 2" «» «"« 

1220 P V HlS Met Ar 9 As P His 

1230 

ACA AAC AGC AAT GGA TCT TTT "tyst unr- ^ 

~ v- „ y Cyo JS K t™ E «* «. =« ™ 

* U 1245 ' 

TGG CAA GAA AAA TAA 

Trp Gin Glu Lys „ co 
1250 ^ 3759 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS - 

IbI typf TH: 1252 ™*no '.acids 
' TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52- 
M.J Pro L ys He Asn Ser P„ e Asn Tyr ^ Asp prQ ^ ^ ^ ^ 

Thr l le Leu Ty r I le L ys Pro o ly G1 Cys Gin Glu phe ^ £ ^ 

■ 30 
Phe Asn lie Met L ys Asn Ile t?q Ile Ile pro ^ ftsn v ^ 

oiy t S Thr Pro Gln Asp ^ His pro pro ^ Le 4 u 5 ^ g A ^ 

A-P Ser ser T y r Tyr *. p Pro Asn Tyr Leu ^ ^ ^ ^ 

«p ». l™ ,ys „. VI1 Thr ,. ys „. the ssn Arg ^ ^ 
— — «j « y ,i. L . u Lau „„ al „ Leu ser Lyo ^ J n 5 

^ ... V y „.„ ASP A „ Thr f „ ssp Asn oln ph< ^ ^ 

"" S " "* v " Glu L » »» s « «» «y s.r Z A „ p „. Lcu 
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130 135 140 

Leu Pro Asn Val lie He Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
145 150 155 160 

Asn Ser Ser Asn lie Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 ' 175 

Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

Arg Phe Asn Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 

Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
225 230 235 240 

He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 ^ 250 255 

Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

Thr Asn Leu Leu Ala Asp* Tyr Lys Lys He Ala Ser Lvs Leu Ser Lys 
275 280 285 

Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
290 295 300 

Ala Lys ; Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
305 310 31S 320 

He Asn Lys Phe Asn Asp lie Phe Lys Lys Leu Tvr Ser Phe Thr Glu 
325 330 * 335 

Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
3 7 0 375 380 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He lie Thr Pro He Thr 
385 390 395 400 

Gly Arg Gly Leu Val Lys Lye He He Arg Phe Cys Lys Asn lie Val 
405 410 415 

Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tvr 
450 455 460 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 sos 510 
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**" "** tS "* *~ " ■ »« * Leu Asp g. oa „ Ly , „ 

« gj «. «s„ ^ ^ ^ s „ sf ■ mp 

J- g. II. ryr T*r ^ jj. s «° s . r Mu phe 

— A5 „ g. „„ V>1 oln Ali ^ Phe w ser Trp £ 

575 

Gin Gin Val Leu Val Asp Phe Thr Thr 

580 P Thr ™5 Glu a Asn Gin Lys Ser Thr 

" 590 
«! ~P g. .1, «. »,„ „. f „ u . yal ^ ^ 

«, Jjo *,„ „. Cln tys Gly 2 • " 

62 0 

£»eu Glu Leu Leu Glv Ala riw T i ' v 

625 ^ 630 L6U L€U Glu Pro Glu Leu 

64 0 

Leu lie Pro Thr lie Leu Val Ph~ Thr- n * 

645 PhG THr L V S Ser Phe Leu Gly Ser 

Ser Asp Asn Asn Lys val ^ ftia ftsn ^ ^ ^ 

Ar g OXu L ys Trp Lys cxu V al Tyr ser phe ffi .™ 

Trp «ec Thr Lys Ile Asn ^ Gln ^ ^ ^ ^ ^ 

Tyr >!« Ala Leu Gln Asn Gln Val ftsn M , ^ ™ ^ ^ ^ ^ 
Ser L ys Tyr Asn ser Tyr Thr Leu Q l u ? lu ^ Asn Glu Leu Thr Z 

lvs ryr ASP xj. Lys cm Iie olu - Asn Glu Leu ^ 

XX. Ala Asn Asn Ile Asp phe ^ 21 ■ 

faU 76 5 

Tyr Leu Met Lys Leu lie Asn ri„ u^i r , 

770 X Iie ^ n Glu Vai Lys lie Asn Lys Leu Arg Glu 

Tyr Asp Glu Asn V^l Lvs Thr T Vr i t 

78 5 7 £o Vr LGU Leu lie Gln His 

800 

Gly Ser lie Leu Gly Glu Ser Gln Gln n„ , , 

805 f 1 " Leu Asn Ser Met Val Thr 

810 815 
A*P THr Leu A| n Asn Ser Xle Pro Phe Ly3 Leu Ser Ser ^ Thr ^ 

Asp Lys jje Leu Ile ser Tyr Asn Lyg ^ ^ 

845 

Ser ser Ser Val L eu Asn „ f , Arg Tyr Lys Asn ^ Lys ^ 
Thr ser Giy Tyr Asp Asn Ile Asn ^ ^ ^ 

„ 880 
Tyr .Pro Thr Asn Lys Asn Gln Phe Gly lie Tvr A^n a 

y ile x V r Asr > Asp L ys Leu ser 
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10 



15 



30 



35 



45 



885 890 



895 



Glu Val Asn He Ser Gin Asn Asp Tyr lie He Tyr Asp Asn Lys Tyr 
900 905 910 Y 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

LyS Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 

935 940 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 ^ 960 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
2() 980 985 990 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 10 05 

25 ° ly iti 0 LeU Ile Asp GIn tois Ser IiG LeU Asn io2o Gly ASn Ile HiS 

YSi«; Ser " ASP ASn IlG LGU Phe LyS Ile Val Asn C ^ s Ser T Y* Thr Arg 
1025 1030 1035 X xolo 

Tyr lie Gly He Arg Tyr Phe Asn lie Phe Asp Lys Glu Leu Asp Glu 
1045 1O50 X055 

Thr Glu lie Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn Ile Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 108 5 

40 tnSn Val LyS Pr ° ASn ASO Phe Ile As P Ar * Ar 9 L /s Asp Ser 

• 1090 1095 iioo 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
1105 1110 1H5 nSo 

Leu Tyr Ser Gly He Lys Val Lys Ile Gin Arg Val Asn Asn Ser Ser 
"25 1130 U35 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr Ile Asn Phe 
5 0 1140 H45 1150 

Val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 H60 lies 

55 ???* LyS GiU LYS TKr Iie LyS Ile Ser Ser Ser Gl Y As « Arg Phe 

■ LA,U 1175 1180 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 
1185 1190 H95 1200 

60 Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 

1205 1210 1215 

Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His 
65 1220 1225 123 0 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Gly 
1235 1240 - y 



1245 



70 



Trp Gin Glu Lys 
1250 
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(2) INFORMATION FOR SEQ ID NO:S3 ; 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 1463 base pairs 
(B> TYPE : nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

<B> LOCATION: 108. .1460 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO-S3- 
ACSATCTCGAT CCCCCGAAAT TAArACGACT CACTATAGGG GAATTCTGAG CGGATAACAA 
rrCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ..ATACC ATG GGC CAT 



Met Gly His 
1 



:ss 21 «• - - ss q si m as s* 

S SI SI? S JE SS K 2 E S £ S SI SI £S "I 

30 3S 

s k e ;;i iii s»s m sj Arg iV, EE 

SIT K K S. S ™ K SI SI i 55 s» si s jss; 

™ si is si s: si in si si ?t i" *«' ™ <*» 

70 7" Gly As P v *l Tyr Lys Tyr Pro Thr 

£n ^ J£ "J *™ Tyr Asn ^ *~ ^ ™ 

B5 r 9 q iyr Asn As P Lys Leu Ser Glu VaJ Asn 

lie E §£ £J ™ ™ GAT AAT TAT ^ w ^ 

100 P X J S 1XG Ile Asp Asn Lys Tyr Lys Asn Phe 

Se' lH K SI" ™ ™ j"J AJT OCT AAC TAT GAT AAT AAG ATA GTA 
120 9 S Pr ° As P Asn Lys Il e Vai 

1-25 130 

£E -I J£ J£ 2X ?nT ?S J2 r ATG AGG GftT ^ AAT 

135 Y r Ile Asn C V S Met Arg Asp Asn Asn 

"° 145 
TCA GGA TGG AAA GTA Tf-T <— t~r ■> «. 

- «y tjp v,j si si si si aSi sk s in ss. is s 

SK SI aSI s ss k si ss & s s. je £ ;oi si si 

S SI Sj ?II III SI IS SI S. 5 HI S3 S f 2 



60 
116 

164 

212 

260 

3 08 

356 

404 

452 
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548 



596 



64 4 



692 
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ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 
Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn Gly Asn Leu 
; 200 205 210 

5 ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAli GTT AGT GAC 

He Asp Lys Lys Ser He Leu Asn Leu Gly Asn lie His Val Ser Asp 
215 220 225 

AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT GGT 
■IU Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg Tyr He Glv 

220 ->-jc . 7 



40 



_ ; ~ ^ ^ 1 ^-n im tr^i AAT AGA TTA TAT AGT 

He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 
310 315 320 

GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 
Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser Thr Asn Asp 
325 330 335 ^ 

AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 
Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe Val Ala Ser 
340 345 350 355 

AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC ACA AAT AAA 
Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr Thr Asn Lys 
3 60 365 370 

4> GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT AAT CAA GTA 

Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg. Phe Asn Gin Val 
375 380 385 

GTA GTT ATG AAT TCA . GTA GGA AAT TGT ACA ATG AAT TTT AAA AAT AAT 
Val Val Met Asn Ser Val Gly Asn Cys Thr Met Asn Phe Lys Asn Asn 
3 90 395 400 

AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT GTA GTT 
. Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp Thr Val Val 

4 °5 410 415 

GCT AGT ACT TGG TAT TAT ACA CAT ATG AGA GAT AAT ACA AAC AGC AAT 
Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn Ser Asn 
420 425 430 435 

GGA TTT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA GAA AAA 
Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin Glu Lys 

440 AAC „ r- ^ 



"5 TAA 

(2; INFORMATION FOR SEQ ID NO: 54 



740 



788 



30 235 240 

ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 
He Arg Tyr Phe Asn lie Phe Asp Lys Glu Leu Asp Glu Thr Glu He 
{? 2 « 250 255 

CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA AAG GAT TTT 
Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn lie Leu Lys Asp Phe 
20 260 265 270 2 75 

TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 
Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu Leu Asn Val 
280 285 290 

TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT ACT TTA AGC 
Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser Thr Leu Ser 
295 300 305 

in tT 7 ^ T ATA AGA AGC ACT ATT CTT "A GCT AAT AGA TTA TAT AGT 1076 

He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
310 315 320 



836 



884 



932 



980 



1028 



1124 



1172 



1220 



1268 



1316 



1364 



1412 



1460 



1463 



70 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 
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(B) TYPE: amino acid 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO 54- 

- Oly „ is Hi. «. His „i s „ is „ ls His His His Ser Ser H i8 

xi. mu cx y Afg „ is Met Ala ser M| , Ala Leu ser ser T ^ r T - r 5 Asp 

, 5 Asp Lys Ile Leu Ile ser ^ ph ^ ^ ^ ^ j° ^ 

4 5 

S-r S.r Sei - v, a .eu ^ , y , A „ ^ ^ ^ ^ _ 

^ s « oly Tyr „„ ,„ A<n ne Asn tl . ^ ;° ssp v>i Tyr ^ 

80 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly H e Tvr A<*n a r 

85 y on y As n Asp Lys Leu Ser 

95 

Olu. V.1 Asn ^ Ser Gln Asn Asp ^ ^ ne ^ ^ ^ ^ 

LySASn us S " -P -X Arg XXe Pro A sn Asp Asn 

XXe VaX Asn vai A sn Glu Tyr Thr „. Ile ^ ^ " 

5 140 - 

Asp Asn Asn Ser Glv Tm t v a i ^ 

145 Gly L ' s Val s *r Leu Asn His Asn Glu He He 

155 160 
Trp Thr Leu Gin Asp Asn Ser Glv lie A-n n„ T 

165 Y iS *™ Gln h Y s Leu Ala Phe Asn 

° 175 
TV* CX V Asn . jj ; Asn Qly Ue ser ^ ^ Asn Ly£j ^ ne ^ 

45 V * 1 ^ JJ- Thr A sn ASP Arg Leu Gly Asp Ser Lys ^ ^ Ile Asn 

Cly AS n Leu Ile Asp Ly . Ly| ser Ile Leu ^ ^ ^ ^ 

VaX ser Asp Asn Ile Lfu phe Lys rl& ^ ^ ^ ^ ^ ^ 

240 

TV, He Gly XXe Arf Tyr Phe Asn Ile ^ ftsp ^ ^ 
Thr CXu XXe OXn Thr Le u Tyr Aon A| „ Glu Pro A . n Al- ^ ^ 
Ly. Asp P.e Trp Gly Asn Tyr ^ ^ ftsp ^ ™ ^ 

Leu A| „ vaX Leu Lys Pro A| n Asn Phe Ile Asn ^ ^ ^ ^ ^ 
Tjr Leu Ser XXe As „ As „ Ile Arg Ser Thr ^ ^ ^ 

" ^ lit ^ S V - <-» ^ Arg VaX As „ Asn Ser 3" 



20 



25 



30 



35 



40 



50 



35 



60 



65 



- 326 - 



WO 98/08540 



PCT/US97/15394 



thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr lie Asn Phe 
340 345 350 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
355 360 365 

Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arq Phe 
370 375 380 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Cys Thr Met Asn Phe 
385 3 90 395 400 

Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lvs Ala Asd 
405 410 415 

Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr 
420 425 430 

Asn Ser Asn Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Tro 
^35 440 445 

Gin Glu Lys 
450 



10 



15 



30 



40 



45 



60 



65 



(':>) INFORMATION FOR SEQ ID NO: 55: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ~ "DNA" 

( ix> FEATURE : 

<A> NAME /KEY : CDS 

(B) LOCATION: 108 .. 146.3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 16 4 

His His His His His His His His His Ser Ser Glv His He Glu Glv 
5 10 15 



CAT ATC GCT AGC ATG GCT CTT TCT TCT TAT ACA GAT GAT AAA ATT 212 
Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys lie 
20 2 5 30 35 

TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA ACT AGT TCA 260 
Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lvs Ser Ser Ser 
40 45 ' 50 

GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT ACT TCA GGA 
Val Leu Asn Met. Arg Tyr Lys Asn Asp Lvs Tyr Val Asp Thr Ser G 1 y 
55 60 65 

TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA TAT CCA ACT 
Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys Tyr Pro Thr 
70 75 fio 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 4 04 

Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser Glu Val Asn 
/u 85 90 95 



30B 



356 
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980 
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1076 



1124 
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37 * 380 



385 



SIT K. 52 S KMSJ £ 25 iS .5? iij32 & 21 
i2 K SS ff2K E? 2S5 S K K 2J £ «f 

410 415 . 

"IT ffT c GT ^ TGG TAT TAT ACA CAT A ^ AGA GAT CAT ACA AAC AGC 
Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Hil Thr Ser 



425 4^ 0 

430 435 



15 



He Glu Gly Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp 

25 30 

Asp Lys lie Leu He Ser Tyr Phe Asa Lys Phe Phe Lys Arg He L ys 
*> 3 4 0 45 1 

Ser Ser Sar Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 

55 60 
Thr ser Glv tyr Asp Ser Asn lie Asn He Asn Glv Asp Val Tyr Lvs 

75 80 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
85 90 95 

Glu Val Asn lie Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 
100 105 110 

Lys Asn Phe Ser lie Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 

125 

Lys lie Val Asn Val Asn Ann Glu Tyr Thr He lie Asn Cys Met Arg 

135 14 0 

Asp Asn Asn ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

155 160 
Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
165 170 175 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 

185 190 

Val Thr lie Thr Asa Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 



1316 



1364 



1412 



1460 



22 SJ & s SI £ e as sk. ss s; S? Sf. sr- 

445 450 
AAA TAAAAGCTT 

Lys 1472 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 4 52 amino acids 
<B) TYPE: amino acid 
<D) TOPOLOGY: iinear 

<ii> MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION : SEQ ID NO: 56: 

Met Gly His. His His His His His His His His His Ser Ser Gly His 

. 5 10 - - 



205 



™*I/<»S40 PCT/US97/1S394 

GlY 21S " e ASP Gln j£ Ser II* Leu Asn Leu Gly Asn Ile His 

220 

val uer Asp Asn Ile Lf u P ne Lys He v,l Asn c ys Ser Tyr Thr Arg 

235 240 

Tyr lie Gly He Arg Tyr Phe Asn He Phe Asn Lvs n„ r 

24 5 !T* !f L y s Glu Leu Asp Glu 

250 ' 2 55 

Thr Glu Ile gs Thr Leu Tyr ser ^ ^ ^ ^ lie Leu 

L y S As P p ; . Trp Gly Asn Tyr L _ Leu Tyr Asp Lys G ^ r 7 y ° r ^ ^ 

l-«}.n Val Leu Lys Pr« Asn Asn Phe lie Asp Ar 9 ^ Ly5 Asp Ser 

* 300 
Thr Leu Ser Ile Asn Asn Ile ftrg ^ ^ ^ ^ ^ ^ ^ 

UU Si* ^ ?}» ^ V,l Asn Asn Ser ^ 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr Phe 

Val Ala ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Z Ala xhr 

360 365 
Thr Asn Lys Glu Lys Thr H^iv^ti^c? 

370 Y 37s Y Ser SGr G1 V Asn Arg Phe 

J/b 3 80 

Asn Gin Val Val Val „ Asn Ser y-1 . oly ^ ^ ^ 

395 400 

Phe Lys Asn Asn Asn Gly Asn Asn lie Gly Leu Leu Gly Phe Lys Ala 

410 4 15 

Asp Thr val val Ala Ser Thr Trp Tyr Tyr Thr H iS Met ^ Asp HxS 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe lie Ser Glu Gil His Glv 

445 



Trp Gin Glu Lys 
450 



(2) INFORMATION FOR SEQ ID NO; 57 : 

( i ) SEQUENCE CHARACTERISTICS - 
{A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(XI) SEQUENCE DESCRIPTION : SEQ ID NO: 57 

CGCCATGGCT CTTTCTTCTT ATACAGATGA T 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 29 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



- 330 - 



WO 98/08540 



PCT/US97/15394 



10 



20 



60 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GCAAGCTTTT ATTTTTCTTG CCATCCATG 

(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3876 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS : double 
I - ( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



29 



48 



96 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..3873 

(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

tj? p CA ^T A nZ A A T T AAC ^ TTT ^ T ™T TCA GAT CCT GTT GAT AAT 
Met Pro He Thr He Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 
' 5 10 

t7 T 7 TA T TA GAT ACT ° AT TTA -AAT ACA CTA GCT AAT GAG 

M} L>s Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 

CCT GAA AAA GCC TTT CGC ATT ACA GGA AAT ATA TGG GTA ATA CCT GAT 
Pro Glu. Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro £J 

; v s 35 40 45 

AGA TTT TCA AGA AAT TCT AAT CCA AAT TTA AAT AAA CCT CCT CGA GTT 
Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys 9ra Pro Arg VaT 

55 60 

40 A ^ A AGC CCT AAA AGT GGT TAT TAT GAT CCT AAT TAT TTG ACT ACT GAT 

Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr leu Ser TnT AsJ 

70 75 80 

TCT GAC AAA GAT ACA TTT TTA AAA GAA ATT ATA AAG TTA TTT AAA AGA 
Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lvs Leu Phe Ly^-ArJ 

tTL ^ I CT G ^ ATA GGA GAA GAA TT A ATA TAT AGA CTT TCG ACA 

He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr Arg Leu Ser Thr 
100 105 il0 

GAT ATA CCC TTT CCT GGG AAT AAC AAT ACT CCA ATT AAT ACT TTT cut 
Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro lie £J Thr Pnl AsJ 

55 115 120 125 

TTT GAT GTA GAT TTT AAC AGT GTT GAT GTT AAA ACT AGA CAA GGT AAC 
Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr ArJ G^ G^y 

13 5 140 

AAC TGG GTT AAA ACT GGT AGC ATA AAT CCT AGT GTT ATA ATA ACT GGA 
Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val He He £hr Gly 

150 155 16u 

6S ^ GA GAA ^ ATT ATA GAT CCA GAA ACT TCT ACG TTT AAA TTA ACT 

Pro Arg Glu Asn He lie Asp Pro Glu Thr Ser Thr Phe Lys ™" Thr 

165 17 <> * 175 
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288 



336 



384 



432 
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180 yjy Pfte-Gly. Ala Leu Ser lie lie 

• 190 

TCA ATA TCA CCT AGA TTT ATG PTfl no* 

■"• fS£ p " - - K a.S E E ES s e e 
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345 3 50 

.Si 21 E Si S SI S Si E E 55 SI E ?K E E 

Jb0 365 
GTA CAA AAT AGG AAA ATA TAT CTT ma An ^ ^ 

Val Gin Asn Arg Lys lie Tyr ZIu Ser EI S*? ACT ° CG GTT ACQ 

370 r Ueu Ser Asn v al Tyr Thr Pro Val Thr 

J/ => 380 

2! »E K E e E E SIT E E S Si E E E E 

395 400 

E S E E E 25 E SI? 2i E E K Si E r» TrT 

405 c Gln Asn Leu Ser 

S ^ S Z£ ^ ^ ™ CCT «A ATG CTT Z TTA 

420 Y ^25 LeU Tyr Leu 

IF. K E E E E E S E E E r I™ ™ T " ~» 

435 Y P G1 y- Ar 9 Ser Leu Tyr Asn 

H * u 445 

AAA ACA TTA GAT TGT AGA GAG CTT TTA r-r-r A ^ 

Lys r hc L . u Asp c , s Arg „„ err ttj „ «j ^ «j « ™ „ c 
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450 455 460 

TTT ATA GGT GAT ATT AGT GAT GTT AAA ACT GAT ATA TTT TTA AGA AAA 1440 
Phe lie Gly Asp lie Ser Asp Val Lys Thr Asp lie Phe Leu Arg Lys 
465 470 475 480 

GAT ATT AAT GAA GAA ACT GAA GTT ATA TAC TAT CCG GAC 'AAT GTT TCA i486 
Asp lie Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

GTA GAT CAA GTT ATT CTC AGT AAG AAT ACC TCA GAA CAT GGA CAA CTA 1536 
Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu His Gly GJ.n Leu 
500 505 510 

GAT TTA TTA TAC CCT AGT ATT GAC AGT GAG AGT GAA ATA TTA CCA GGG 15 84 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

GAG AAT CAA GTC TTT TAT GAT AAT AGA ACT CAA AAT GTT GAT TAT TTG 16 32 

Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 



AAT TCT TAT TAT TAC CTA GAA TCT CAA AAA CTA AGT GAT AAT GTT GAA 1680 
Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
2:> 545 550 555 560 



GAT TTT ACT TTT ACG AGA. TCA ATT GAG GAG GCT TTG GAT AAT AGT GCA 1728 
Asp Phe Thr Phe Thr Arg Ser lie Glu Glu Ala Leu Asp Asn Ser Ala 
565 570 575 

AAA GTA TAT ACT TAC TTT CCT ACA CTA GCT AAT AAA GTA AAT GCG GGT 1776 
Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 



GTT CAA GGT GGT TTA TTT TTA ATG TGG GCA AAT GAT GTA GTT GAA GAT 1824 
Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 



TTT ACT ACA AAT ATT CTA AGA AAA GAT ACA TTA GAT AAA ATA TCA GAT 18 7 2 

Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 



GTA TCA GCT ATT ATT CCC TAT ATA GGA CCC GCA TTA AAT ATA AGT AAT 192 0 

Val Ser Ala He He Pro Tyr lie Gly Pro Ala Leu Asn He Ser Asn 
4: > 625 630 635 640 



TCT GTA AGA AGA GGA AAT TTT ACT GAA GCA TTT GCA GTT ACT GGT GTA 1968 
Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 

ACT ATT TTA TTA GAA GCA TTT CCT GAA TTT ACA ATA CCT GCA CTT GGT 2016 
Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 

GCA TTT GTG ATT TAT AGT AAG GTT CAA GAA AGA AAC GAG ATT ATT AAA 2064 
Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys 
675 680 685 

ACT ATA GAT AAT TGT TTA GAA CAA AGG ATT AAG AGA TGG AAA GAT TCA 2112 
Thr He Asp Asn Cys Leu Glu Gin Arg He Lvs Arg Trp Lys Asp Ser 
690 ' 695 700 



TAT GAA TGG ATG ATG GGA ACG TGG TTA TCC AGG ATT ATT ACT CAA TTT 216 0 

Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe 
to "705 710 715 720 



70 



AAT AAT ATA AGT TAT CAA ATG TAT GAT TCT TTA AAT TAT CAG GCA GGT ">2 08 

Asn Asn lie Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
725 730 735 
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1010 1015 



1020 



3120 



3166 



3216 



AAT GGA AAA TTA ATA GAT ACT ATA AAA GTT AAA GAA CTA ACT GGA ATT 
Asn^ly Lys.Leu.Ile AsMhr He Lys Val Ly^Glu W Thr l" 

AAT TTT AGC AAA ACT ATA ACA TTT GAA ATA AAT AAA ATT CCA GAT Arr 
Asn Phe ser Lys Thr^lle Thr Phe Glu ll^Asn iVe Asp Thr 

GGT TTG ATT ACT TCA GAT TCT GAT AAC ATC AAT ATfi Trr n-rn 
Oiy Ls„ o. TJr o S« *,p Ser «p Ajj^tlS SI I?? ^ 2J 

HI E^gS S !S Si xs iKS°j£.ls; 

JK ££ 2! Si ?;SS SI SI* £ £ ™W °S i£ 

GAT TTA AGA TAT AAT AAA GAA TAT TAT ATG GTT AAT ATA GAT TAT TTA 

Ti p os^ u Ar9 Tyr Asn j*n 0 Glu TYV Tyr ■ Met "Si s AM IX * ^ ^ 2u 

AAT AGA TAT ATG TAT GCG AAC TCA CGA CAA ATT GTT TTT AAT ACA CCT 
Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin lie Val £e £n Thr Arg 

1125 H30 i!35 

AGA AAT AAT AAT GAC TTC AAT GAA GGA TAT AAA ATT ATA ATA AAA An 
Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys lie lie lie Arg 
H40 1145 11SQ 

ATC AGA GGA AAT ACA AAT GAT ACT AGA GTA CGA GGA GGA GAT ATT TTA 

• 19 ^ y 5 Asn Thr Asn Asp I^ 0 Ar9 val Ar 9 G1 v ^yp? ™ 

TAT TTT GAT ATG ACA ATT AAT AAC AAA GCA TAT AAT TTG TTT a-f 
Tyr Phe Asp Met Thr He Asn Asn Lys Ala lyl £1 llu III ^ 

1175 1180 

£n G^ Thr mI? *? A * AT ° AT AGT ACT GAA GAT ATA TAT GCT 

^85 ^ ^on ASP ASO HiS Ser Thr Glu As P He Tyr Ala 

1X90 1195 j.200 

ATA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 
.le Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn III lie Phe 
05 121 ° 1215 

G^ lie Gin Pro m« ^ TAT TAT TAC GCA TCT CAA ATA TTT 

Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin He Phe 

1220 "25 1230 

AAA TCA AAT TTT AAT GGA GAA AAT ATT TCT GGA ATA TGT TCA ATA CfT 
Lys Ser Asn^Phe Asn Gly Glu AsnWe Ser Gly lie Cysjer lie G^y 

ACT TAT CGT TTT AGA CTT GGA GGT GAT TGG TAT AGA CAC AAT TAT rrr 
Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr A^g His Tvr EE 

1255 1260 

vll Pro Thr vll ^s GlLn £* T ^ GCT TCA ™ TTA GAA TCA ACA 

1265 V ?i?n Gly AS " Tyr Ala Ser Leu Leu Glu Ser Thr 

1270 1275 1280 

TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAA 

Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 3876 
1285 1290 
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(2) INFORMATION FOR SEQ ID NO : 60 : 

(i) SEQUENCE CHARACTERISTICS* 

{a! ^t™ 1 1291 ai «ino*acids 
(B) TYPE: amino acid 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
Met Pro lie Thr He A sn Asn Phe Asn Tyr Ser Asp Pro Val Asp AS n 

Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu 

25 30 



2 - n ^s ^u Asn Thr Leu ftsn ^ 

Pro Glu Lys AXa Phe Ar g Ue Thr Gly Asn Ile Trp V; , pro ^ 

Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val 

b5 60 

Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 
Ser Asp Lys Asp Thr Phe Leu Lys Glu lie He Lys Leu Phe Lys Arg 



95 



He Asn ser Arg Giu lie Gly Glu Glu Leu lie Tyr Arg Leu Ser Thr 

105 110 
Asp lie pro Phe Pro Gly Asn Asn Asn Thr Pro lie Asn Thr Phe Asp 

Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gin Gly Asn 

135 140 
Asn Trp val Lys Thr Gly Ser Ue Asn Pro Ser Val He He Thr Gly 

Pro Arg Glu Asn lie lie Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 

170 175 

Asn Ash Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Lou Ser lie xle 

18 5 190 
Ser lie Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 

205 

val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro He 

215 220 
Leu lie Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 

235 240 
He Ala lie Pro Asn Asp Gin Thr lie Ser Ser Val Thr Ser Asn lie 

245 250 255 

Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 

265 270 
Phe Gly Gly Pro Thr lie Asp Leu lie Pro Lys Ser Ala Arg Lys Tvr 

280 28 5 

Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser lie Ala Lys Arg Leu 

295 300 

Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr lie Gly 

310 315 320 

- 336 
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Glu Tyr Lys Gin Lys Leu lie Arg Lys Tyr Arg Phe Val Val Glu Ser 
325 330 335 

Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 
340 345 350 

Glu Leu Thr Gin lie Phe Thr Glu Phe Asn Tyr Ala Lys' He Tyr Asn 
355 360 365 

Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 
370 375 380 

Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin Asn Gly Phe Asn 
385 390 395 400 

He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 
405 410 4 15 

Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 

Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 " 445 

Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 
450 455 460 

Vhe He Gly Asp He Ser Asp Val Lys Thr Asp He Phe Leu Arg Lys 
465 470 475 480 

Asp He Asn Glu Glu Thr Glu Val lie Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu 
500 505 510 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 

Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
545 550 555 , . 560 

Asp Phe Thr Phe Thr Arg Ser lie Glu Glu Ala Leu Asp Asn Ser Ala 
565 , 570 575 

Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 

Vol Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 

Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 

Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn 
625 630 635 640 

Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 

Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 

Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys 
675 680 685 

Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser 
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690 69 ' 1 

Tyr Glu Trp Met Met Glv Thr Tm t~., «=■ . 

70S *H Thr Tr P ^ Ser Arg He He Thr Gin Phe 

715 .720 
" n ^ " e SSr ^ Gln Met Tyr Asp Ser X.eu Asn Tyr G1 „ Ala Gly 
Ala Xle , ys Ala Lys rle Asp ^ „ ^ Lys Lys ^ ^ 

" 750 
Asp ,y 5 Glu Asn lle Lys Ser ^ Val Q1(j ^ ^ ^ ^ ^ ^ 

Asp Val L yS xle Ser Glu ^. Met Asn ftsn u# ..^ ^ ^ 

Clu c ys Ser val Thr Ty r Leu PHe Lys Asn Mec L J Pro Lys yal ^ 

795 800 
Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr ai r 

805 P 9 ' ™ L ^ s Ala Lys Leu i ie Asn 

810 815 
Leu Xle Asp ser „ is Asn Ile Ile yal Qly ^ ^ 

825 830 
».V Ala jy. Val Asn Asn ser Phe cm Asn Thr Ile ^ phe ^ ^ 

Phe ser Tyr Thr ^ Asn g „ Leu ^ ^ ^ ^ ^ ^ ^ 

Phe Asn Asn He Asn Asp Ser Lys Ile Le U ser Zl Gln Aon Arg Lvs 

875 880 

Asn Thr Leu Val A<^r> Thr c^^ /^r ™ 

Val A-p Thr Ser Gly Tyr Asn.Ala Glu Val Ser Glu Glu 

890 895 

Gly Asp val Gin Leu Asn Pro ■ He Phe Prn ph. a nu 

900 PhS Asp Phe L V S Leu Gly 



910 



Ser ser Gly Glu Asp Arc, Gly ^ Val Ile Vfll Thr ^ 



Qon vjIu Asn 



Ile val ryr Asn Ser Met T yr Glu Ser Phe Ser lie Ser Phe Trp Xle 



Arg lie Asn Lys Trp Val Ser Asn Leu Pro Gl y T yr Thr He He Asp 

955 960 
Ser val Lys Asn A|n Ser Gly Trp ^ ^ ^ ^ ^ ^ ^ ^ 

UK ^ G1 " Si? *«P S« Glu Gin ser 12 Asn 



985 99Q 



Phe ser ryr Asp He Ser Asn prQ Gly ^ ^ ^ ^ ^ 

■ LUUO 1005 
Phe Va^Thr val Thr Asn Asn M et „ et Gl y Asn Met Lys xle T yr lie 

1020 

To2S GlY ^ A =r 0 Thr ^ S G1 " Le " Thr Oly XI 

103S 10 



e 
040 



Asn Phe Ser ,ys ThrUe Thr Phe Glu Xl^Asn Lys Xle Pro As^Tnr 
bly Mu Xle Thr^er Asp Ser Asp Asn lie Asn Met Trp He A^Asp 

1065 107Q 



10 



15 



20 



30 
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Phe Tyr lie Phe Ala Lys Glu Leu Asp Gly Lys Asp lie Asn lie Leu 
107S 1080 1085 

Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr Trp Gly Asn 
1090 1095 1100 t 

Asp Leu Arg Tyr Asn Lys Glu Tyr Tvr Met Val Asn lie Asp Tyr Leu 
"'°5 1110 ins 1120 

Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
1125 1130 1135 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He He Lys Arg 
H40 1145 iiso 

He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp He Leu 
H55 1160 lies 

Tyr Phe Asp Met Thr lie Asn Asn Lys Ala Tyr Asn Leu Phe Met Lys 
H70 1175 H80 

Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tyr Ala 
1185 1190 H95 1200 

lie Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn He He Phe 
1205 1210 1215 

Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin Tie Phe 
1220 1225 1230 

Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly lie Cys Ser He Gly 
1235 1240 1245 



Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 
j: > 1250 1255 1260 

Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu Ser Thr 
1265 1270 1275 1280 

40 Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 

1285 1290 

(2) INFORMATION FOR SEQ ID NO; 61: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



50 



60 



65 



70 



(ii> MOLECULE TYPE: DNA (genomic) 



<ix> FEATURE: 

(A) NAME /KEY : CDS 
^ <B> LOCATION: 108.. 1493 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 1 : 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT u6 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His He Glu Glv 
5 10 15 Y 

CGT CAT ATG GCT AGC ATG GCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 212 
Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He Asn Glu Tyr 
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2° 25 

30 35 



: s J2 is z sj is »j s ^ ^ 

JS S 25 S5 I£ ?S S SR .?£ si s §r ?? « <=» ~ 

55 y *H Asn Ala Glu Val Ser Glu Glu 

S 5 STS E S S S SI S SI sVsi £ a - 
S S 3? SK SJ 25 s S SIT JK ss s « - «; - 
ni s? & SI E £ ™ sj - E £ . J s s ™ 

S ^ SI JS s & K IS S; ™ ™ £ ™ f K 

s sir ta % B ss £ - - - - Z .„ 

™ - = - SJ ^ SK 25 SS g s ffi « 

: s i - - - - £ £ ■ s ss ~ 5 « - si 

S 51? -S 21 S Z£ SI ffi «■ - si ™ iS ffi ?» JR 

"I SJ ™ ™ -I «J «J "J OTT ™ a„ „ A ACT «* ^ 

200 y " L l" 01u Thr Oly II. 

210 

AAT TTT AGC AAA ACT ATA ACA ttt rA» ^. 

«. ». £ . r jj. T „ r „: s si a; ts is k ,n s « 

SS 25 SJ.S S SI SI SI S - si s ™ ™ ™ 

ts % s: si s ts sk s: si si k - S - - ™ 

£1 "I S 22 S5 "I S Si ;ii »* «* ™ ra „ T 

26 0 2 > 5 r Asn Val Val Lys Asp Tyr Trp Gly Asn 

t" sf js ?s si si sk ™i «j «■ s t ?i a » t ™ ™' 

280 * 71 Tyr Val Asn He Asp T yr Leu 

285 290 

£ 5? S ?K S 2=E IS s s; 51 - SI SI ?s s 



260 



308 



356 



404 



452 



500 



54 8 



596 



644 



692 



740 



786 



836 



884 



932 



980 



1028 
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AGA AAT AAT AAT GAC TTC AAT GAA GGA TAT AAA ATT ATA ATA AAA AGA 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys lie He He Lys Arg 
310 315 32Q 

tT° k GA S? A AAT ACA ' AAT GAT ACT AGA OTA CGA GGA GGA GAT ATT TTA 

lie Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp lie Leu 

3 30 335 



40 



50 



60 



65 



70 



ii) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 462 amino acids 
4 - (B) TYPE: amino acid 

(D) TOPOLOGY: linear 





<ii> 


MOLECULE 


TYPE: protein 


















( 


::i) 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


62 : 










Met 
1 


Gly 


His 


His 


His 
5 


His His His 


His 


His 
10 


His 


His 


Ser 


Ser 


Gly His 
15 


He 


Glu 


Gly 


Arg 

20 


His 


Met Ala Ser 


Met 

25 


Ala 


Leu 


Leu 


Lys 


Asp 

30 


He 


lie 


Asn 


Glu 


Tyr 
35 


Phe 


Asn 


Asn He Asn 
40 


Asp 


Ser 


Lys 


He 


Leu 
45 


Ser 


Leu 


Gin 


Asn 


Arq 
50 


Lys 


Asn 


Thr 


Leu Val Asp 
55 


Thr 


Ser 


Gly 


Tyr 
60 


Asn 


Ala 


Glu 


Val 


Ser 

65 


Glu 


Glu 


Gly 


Asp 


Val Gin Leu 
70 


Asn 


Pro 


He 
75 


Phe 


Pro 


Phe 


Asp 


Phe 
80 


Lys 


Leu 


Gly 


Sei- 


Sei- 
SS 


Gly Glu Asp 


Arg 


Gly Lys 
90 


Val 


He 


Val 


Thr 
95 


Gin 


Asn 


Glu 


Asn 


Ile 


Val 


Tyr Asn Ser 


Met 


Tyr 


Glu 


Ser 


Phe 


ser 


He 


Ser 



1076 



1124 



Wl P T I7* T GAT ATG.ACA ATT AAT AAC AAA GCA TAT AAT TTG TTT ATG AAG i n- 

Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu Phe Met £y*s 
30 345 350 3 55 

AAT GAA ACT ATG TAT GCA GAT AAT CAT AGT ACT GAA GAT ATA TAT GCT 
|5 Msn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp lie Tyr Ala 

360 365 37 0 

ATA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 
He Gly Leu Arg Glu Gin Thr Lys Asp lie Asn Asp Asn lie He 
2() 475 380 385 

^ >T A ^ ^ TG ^ T ACT TAT TAT TAC GCA TCT CAA ATA TTT 

Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin He Phe 
390 395 400 



1220 



1268 



1316 



1364 



AAA TCA AAT TTT AAT GGA GAA AAT ATT TCT GGA ATA TGT TCA ATA GGT 
Lys Ser Asn Phe Asn Gly Glu Asn lie Ser Gly He Cvs Ser He Gly 
405 410 415 " . 

Thr T»* IT 1 ^ GA ? TT G ? A GGT GAT TGG TAT AGA CA C AAT TAT TTG 1412 

Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 
4aU 425 430 435 

GTG CCT ACT GTG AAG CAA GGA AAT TAT GCT TCA TTA TTA GAA TCA ACA 
✓al Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu Spi- Thr 
440 445 450 

TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAAAAGCTT 
Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
455 460 

(2.) INFORMATION FOR SEQ ID NO : 6 2 : 



1460 



15Q2 
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100 



10S 



Phe Trp xle Arg Iie Asn Lys x val ser Asn Leu prQ Gly T ^ 

1ZU 125 
He lie Asp ser Val Lys Asn Asn Ser Gly Trp Ser He G .y i le Ile 

Ub 140 



Ser Asn Phe Uu vi g Thr Leu Lys Gln AsR ^ ^ ^ 

155 160 
Ser Xle Asn Phe ser Tyr Asp Xle Ser Asn Asn >la P ro Gly Tyr Asn 

LX. Trp Phe Phe val Thr val Thr ^ ^ Asn Met Lys 

He Tyr Xle Asn Gly Lys Le U Xle Asp Thr xle Lys Val C G lu ^ 

^uu 205 
Thr Gly xie Asn Phe Ser Lys Thr Xle Thr Phe Glu Xle Asn Lys xle 

Pro Asp Thr Gly Leu Xle Thr Ser Asp Ser Asp Asn xle Asn Met Trp 

He Arg Asp Phe Tyr Xle Phe Ala Lys Glu Leu Asp Gly Lys Asp ^ 

Asn lie Leu Phe Asn Ser Leu Gin Tyr Thr Asn Val val Lys Asp Tyr 

Trp Gly Asn Asp Leu Arg Tyr A=„ Lys Glu Tyr Tyr Met Val Asn lie 

280 285 

Asp Tyr Leu Asn Arq Tvr Mer Tvr * i * 

290 9 yr Tyr Ala Asn Ser Arg Gin U e Val Phe 

. 300 . 

305 Thl " ^ Al ' 9 A5n £S AS " AS P Phe Asn Glu Gly Tyr Lys xle Xle 
He Lys Arg lie Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly 
Asp Xle Leu Tyr Phe Asp Met Thr Xle Asn Asn Lys Ala Tyr ^ Leu 
Phe Met Lys Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp 

He Tyr Ala Xle Gly Leu Arg Glu Gin Thr Lys Asp xj Asn Asp Asn 

J75 380 
lie Xle Phe Gin Xle Gin Pro Met Asn Asn Tnr Tyr Tyr ^ ^ ^ 

395 400 
Gin lie Phe Lys Ser Asn Phe Asn Gly Glu Asn xle Ser Gly He C ys 

S,r Xle Gly Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Jyr l-g H is 

Asn Tyr Leu Val Pro Thr Val Lys Gin Gly Asn Tyr Ala lZ Leu Leu 

q *° 445 
Glu Ser Thr Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 

4b5 460 
<2) INFORMATION FOR SEQ ID NO: 63: 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = *' DN A M 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:63: 

CGCCATGGCT TTATTAAAAG ATATAATTAA TG 

i2) INFORMATION FOR SEQ ID NO: 64: 



iix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..382B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

4^ ATG ACA TGG CCA GTA AAA GAT TTT AAT TAT AGT GAT CCT GTT AAT GAC 

Met Thr Trp Pro Val Lys Asp Phe AsnTyr Ser Asp Pro Vai Asn Asp 
1 5 io 15 



32 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
-0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

-5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 

GCAAGCTTTT ATTC ACT T AC AGGTACAAAA CC 32 

(2) INFORMATION FOR SEQ ID NO : 65 : 

_>o 

( i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3831 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
^ (D) TOPOLOGY: linear 

iii) MOLECULE TYPE: DNA (genomic) 



46 



AAT GAT ATA TTA TAT TTA AGA ATA CCA CAA AAT AAG TTA ATT ACT ACA 96 

Asn Asp He Leu Tyr Leu Arg He Pro Gin Asn Lys Leu He Thr Thr 

20 25 30 

CCT GTA AAA GCT TTT ATG ATT ACT CAA AAT ATT TGG GTA ATA CCA GAA 144 

Pro Val Lys Ala Phe Met He Thr Gin Asn He Trp Val He Pro Glu 

3 5 4 0 4 5 

AGA TTT TCA TCA GAT ACT AAT CCA AGT TTA AGT AAA CCG CCC AGA CCT 192 

Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arq Pro 

50 55 60 

ACT TCA AAG TAT CAA AGT TAT TAT GAT CCT AGT TAT TTA TCT ACT GAT 24 0 

Thr Ser Lys Tyr Gin Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp 

65 70 75 80 

GAA CAA AAA GAT ACA TTT TTA AAA GGG ATT ATA AAA TTA TTT AAA AGA 2 88 

Glu Gin Lys Asp Thr Phe Leu Lys Gly He He Lys Leu Phe Lys Arg 

85 90 95 

ATT AAT GAA AGA GAT ATA GGA AAA AAA TTA ATA AAT TAT TTA GTA GTT 3 36 

lie Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Val Val 
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E ee e e s; 5ES ssfeag « 

*** as. - »" £ s e s: s ss ss ™ e sx e 

K SS K SJ .S E E E S E E 5 22 E - Sy 
E - E E E «; •* ™ « S E E ™' <» « 

ig5 P lyr Thr Ala Ser Leu Thr Leu Gin Gly 

§K SS EE Sfi s 2J S3 E sg S I™ TCT CT » 

180 Pne G1 y Thr Leu Ser lie Leu 

190 

AAA GTA GCA CCT GAA TTT TTG TTa a™ 

- «. ^ „„ EE E S EE EE s EE 

Si E E SI E ™ S£ EE EE E *™ <™ °» 

210 215 CyS Met As P Pr <> Val 

E E S E SI SE E SE'E ee e ss e 

e e as e e e ^ e e ™ e e a; S 

E E E EE SS EE E SK IS EE ™ ™ - 

260 ^ ne Crlu Leu Tyr Thr 

" 65 270 

e s e us e sk s a e sk e e is; e e" 

E E/E E £ E SI EE EE E E E SS 
E SI E E E S S E E E E E EE E E 

315 320 

AAA TAT AAA AAA ATA TTT t<~t R ^ m 

rr, E E E E E E E E E E E E 
E SS E E E Sit E E E E E E E. E 5 E 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1O08 
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t 

GAC TTG ACT AAT GTT -ATG TCA GAA GTT GTT TAT TCT TCG CAA TAT AAT 
Asp Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tyr Asa 
355 360 365 

GTT AAA AAC AGG ACT CAT TAT TTT TCA AGG CAT TAT CTA CCT GTA TTT 
Val Lys Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
370 375 3 8 o 



15 



20 



40 



50 



60 



65 



TTA ACA AAT AAA GGT TTT AAT ATA GAA AAT TCG GGT CAG AAT ATA GAA 
Leu Thr Asn Lys Gly Phe Asn lie Glu Asn Ser Gly Gin Asn lie Glu 
405 410 

AGG AAT CCT GCA CTA CAA AAG CTT AGT TCA GAA. AGT GTA GTA GAT TTA 
Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
420 ,425 43 0 

TTT ACA AAA GTA TGT TTA AGA TTA ACA AAA AAT AGT AG A GAT GAT TCA 
Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
435 440 445 

ACA TGT ATT AAA GTT AAA AAT AAT AGA TTA CCT TAT GTA GCT GAT AAA 
Thr Cys lie Lys Val Lys Ash Asn Arg Leu Pro Tyr Val Ala Asp Lys 
450 455 460 



ACT AAT GTA CAA AAT TAT TCA GAT AAT TTT TCA TTA GAT GAA TCT ATT 
Thr Asn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp Glu Ser lie 
4 85 4 90 4 95 

TTA GAT GGG CAA GTT CCT ATT AAT CCT GAA ATA GTA GAT CCA CTA TTA 
Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 
500 505 5io 

CCC AAT GTT AAT ATG GAA CCT TTA AAT CTT CCA GGT GAA GAA ATA GTA 
Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu lie Val 
515 520 525 

4^ TTT TAT GAT GAT ATT ACT AAA TAT GTT GAT TAT TTA AAT TCT TAT TAT 

Phe Tyr Asp Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tyr Tvr 
530 535 540 



580 585 590 

TTA TTC TTA AAT TGG GCG AAT GAA GTA GTT GAG GAT TTT ACT ACA AAT 
Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 
595 600 605 

ATT ATG AAG AAA GAT ACA TTG GAT AAA ATA TCA GAT GTA TCA GTA ATA 
He Met Lys Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Val He 
610 615 620 



1104 



1152 



GCA AAT ATA TTA GAT GAT AAT ATT TAT ACT ATA AGA GAT GGT TTT AAT 12 00 

l" Axa Asn He Leu Asp Asp Asn lie Tyr Thr He Arg Asp Gly Phe Asn 

385 390 395 400 



1248 



1296 



1344 



1392 



GAT AGC ATT TCA CAA GAA ATA TTT GAA AAT AAA ATT ATT ACA GAT GAG 1440 
Mf Asp Ser He Ser Gin Glu He Phe Glu Asn Lys He lie Thr Asp Glu 

465 470 475 480 



1488 



1536 



1584 



1632 



TAT TTG GAA TCT CAA AAA TTA AGT AAT AAT GTT GAA AAT ATT ACT CTT 16 8 0 

Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 
545 550 555 560 



ACA ACT TCA GTT GAA GAA GCA TTA GGT TAT AGC AAT AAG ATA TAC ACA 1728 
Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys He Tyr Thr 
565 570 575 

TTT TTA CCT AGC TTA GCT GAA AAA GTG AAT AAA GGT GTT CAA GCA GGT 
Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Glv 

CQn r-nr- 7 



1776 



1824 



1872 



ATT CCA TAT ATA GGA CCT GCC TTA AAT ATA GGA AAT TCA GCA TTA AGG 1920 
/U He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arg 
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635 640 



650 655 

?£ K E SI Si -J ffi isf. - - - s . j» «; jjj 

6 85 

51 K Si §X S 21 ^ Arg Trp £ £ £ - « » ffi 

i s£5rgfi;ss£ ffiS r ss s: is 

"I SK fif ^ ^ K p s ?£ SS S £1 52 JE & S 

^ ^ Z2 Glu Tyr ST !" « «* 

740 7 y " G1/ Ser As P Glu Asn 

/4b 750 

ATA AAA AGT CAA GTT GAA AAT tta 

». l„ « ta ... «s $s ssss sj si SJ & «: 
■k sk s SI? £1 -ffi - - - - ~ I ™ ™ 

S ™ S 53. K : S2 S S S £ S S -s - S «j 

- - " - & - - - SS Zi £. is x » - 5 

- - - - SJSS© £ SS 2 55 S2 - 

S SK X JE. SK-iSS S ™ "J ™ «£ «J Tjr Z i„ «r. 

835 ro Pfte Asn Ile P^e Ser Tyr Thr 

2£ ™ ™ ^ Asp - IX* S A^ 2* ™ ™ - T AGT ATT 

850 y lie Ile Asn Tyr Phe Asn Ser lie 

" b 860 

£ js e. & ?n s « e ss as s: k - - ns ss 

SS S E S-^ K SI iSJ s? .» g sj jjj »j «j I" 

si si » s ^ s? si s m S s s s 5 si 

90S 91Q 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



544 



2592 



2640 



2688 
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40 
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AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT AGC GCT ATT TAT 
Lys lie lie Val Asn Leu Asn Asn Asn lie Leu Tyr Ser Ala lie Tvr 
915 920 925 

GAG AAC TCT AGT GTT AGT TTT TGG ATT AAG ATA TCT AAA GAT TTA ACT 
Asn Ser Ser Val Ser Phe Trp lie Lys lie Ser Lys Asp Leu Thr 
930 935 940 ' 



ATT AGA GAT TTT AAT ATT TTT TCT AAA GAA TTA AGT AAT GAA GAT ATT 
lie Arg Asp Phe Asn lie Phe Ser Lys Glu Leu Ser Asn Glu Asp He 
1060 1065 



1070 



GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA AAT CCT ATT ACT 
Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
1125 H30 1135 

ATT AAA TCA GTA TCT GAT AAG AAT CCT TAT AGT AGA ATT TTA AAT GGA 
He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 
1140 H45 1150 

GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG AAA TAT ATG ATA 
Asp Asn He He Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met lie 
1155 1160 H65 

ATA AGA GAT ACT GAT AC A ATA TAT GCA ACA CAA GGA GGA GAG TGT TCA 
He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
1170 1175 1180 



2784 



2832 



AAT TCT CAT AAT GAA TAT ACA ATA ATT AAC AGT ATA GAA CAA AAT TCT 2flfiO 
10 Asn Ser His Asn Glu Tyr Thr He He Asn Ser He Glu Gin Asn Ser 

945 950 955 960 

GGG TGG AAA TTA TGT "ATT AGG AAT GGC AAT ATA GAA TGG ATT TTA CAA 
Gly Trp Lys Leu Cys lie Arg Asn Gly Asn He Glu Trp He Leu Gin 
965 970 975 

GAT GTT AAT AGA AAG TAT AAA AGT TTA ATT TTT GAT TAT AGT GAA TCA 
Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 
980 985 990 

TTA AGT CAT ACA GGA TAT ACA AAT AAA TGG TTT TTT GTT ACT ATA ACT 
Leu Ser His Thr Gly Tyr Thr Asn Lys -Trp Phe Phe Val Thr He Thr 
995 1000 1005 

AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT GGA GAA TTA AAG 
Asn Asn lie Met Gly -Tyr Met Lys Leu Tyr He Asn Gly Glu Leu Lys 
x01 ° 1015 102 0 

-t\ S? G AGT CAA AAA ATT GAA GAT GAT GAG GTT AAG TTA GAT AAA ACC 

= Gln LYS Ile G1U ASp ^ U As P Glu Val L V S Leu A sp Lys Thr 

025 1030 1035 1040 

ATA GTA' TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT CAG ATG CTT TGG 
Ile Val Phe Gly lie Asp Glu Asn Ile Asp Glu Asn Gin Met Leu Trp 
10^5 1050 1055 



2928 



>976 



3024 



3072 



3120 



3168 



3216 



AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT ATT AAA GAT TAT 
Asn lie Val Tyr Glu Gly Gin lie Leu Arg Asn Val lie Lys Asp Tyr 
1° 7 5 10B0 1085 

4:> TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT ATT ATT AAT GAT 

Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr He lie Asn Asp 
1090 1095 HOO 

AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT GTA CTT GTA CTT 3 360 

M Asn Tyr He Asp Arg Tyr lie Ala Pro Glu Ser Asn Val Leu Val Leu 

1105 1110 1H5 H20 



3264 



33i; 



3408 



3456 



3504 



3551 



70 ^ ^ T TGT GTA TAT GCA TTA TTA CAG AGT AAT TTA GGT AAT TAT 3600 

70 Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tyr 
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'" s . ■ , 200 : 

1205 Ser Lys Asn L y s T y r 

i 1215 



51 £ ^ K 15 £ "J s SK IS JS SS K 2 S 

1225 1230 
GAT ATA TAT AAA CCT TGG AG A TTT TPT ttt * a* 

A.P ». jg Pro „ p „J K «» »J Aon S 55 S S »• 



1255 1260 

£rn "237 t7 TCT AGG GAT CCA GGA TGG GTA GAG TAA 

rrp WB Phe lie Ser Arg Asp Pro Gly Trp Val Glu 

1270 1275 
<2> INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 1276 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi> SEQUENCE DESCRIPTION: SEQ ID. NO: 6.6 : 
Met Thr Trp Pro Val Lys A S p Phe Asn Tyr Ser Asp Pro Val Asn Asp 

10 15 
As.n-.Asp lie L eu Tyr Leu Ar 9 He Pro Gin Asn Lys Leu He Thr Tnr 

25 30 
Pro Val Lys Ala Phe Met He Thr Gin Asn He Trp Val xle Pro Glu. 

0 4 5 

Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro 

bS . 60 
Thr ser Lys Tyr cln Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp 

Glu Gin Lys Asp Thr Phe Leu Lys Gly He He Lys Leu Phe Lys Arg 

90 95 

He Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Val Val 

105 110 

Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp 

120 125 

Phe Thr Arg His Thr Thr Asn He Ala Val Glu Lys Phe Glu Asn Gly 

135 14Q 

J.r Trp Lys Val Thr Asn He lie Thr Pro Ser Val Leu He Phe Gly 



- - — - inj. i_,tr U uin 

165 170 175 



Gin Gin Ser Asn Pro Ser Phe Glu Giy Phe Gly Thr Leu Ser He Leu 

18S 190 

Lys val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn 
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195 200 205 

Gin Ser Ser Ala Val Leu Gly Lys Ser He Phe Cys Met Asp Pro Val 
210 215 220 

lie Ala Leu Met His Glu Leu Thr His Ser Leu His Gin Leu Tyr Glv 
- 25 230 2 35 * Y 24 8 

He Asn He Pro Ser Asp Lys Arg He. Arg Pro Gin Val Ser Glu Gly 
245 250 255 

Phe Phe Ser Gin Asp Gly Pro Asn Val Gin Phe Glu Glu Leu Tyr Thr 
260 265 270 

Phe Gly Gly Leu Asp Val Glu lie He Pro Gin He Glu Arg Ser Gin 
275 280 285 

Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp He Ala Lys Arg Leu 
290 295 300 

Asn Asn He Asn Lys Thr He Pro Ser Ser Trp lie Ser Asn He Asd 
305 310 315 320 

Lys Tyr Lys Lys He Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn 
32 5 3 30 33 5 

Thr Gly Asn Phe Val Val Asn He Asp Lys Phe Asn Ser Leu Tvr Ser - 
340 345 350 ' 

Asp Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tyr Asn 
355 360 365 

Val Lyh Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
370 3.75 380 

Ala Asn He Leu Asp Asp Asn lie Tyr Thr He Arg Asp Gly Phe Asn 
385 3 *0 395 400 

Leu Thr Asn Lys Gly Phe Asn He Glu Asn Ser Gly Gin Asn He Glu 
405 410 415 

Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
420 425 



430 



Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp ser 
435 440 445 

Thr Cys He Lys Val Lys Asn Asn Arg Leu Pro Tvr Val Ala Asp Lys 
45 ° 455 460 

Asp Ser He Ser Gin Glu He Phe Glu Asn Lys He He Thr Asp Glu 
465 ,470 475 480 

Thr Asn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp Glu Ser He 
485 490 495 

Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 
500 505 ' 510 

Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 
515 520 525 

Phe Tyr Asp Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tvr Tvr 
530 535 540 " 

Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 
545 5S0 555 560 

Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lvs He Tyr Thr 
565 570 ' 575 
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^ -* tS ^ J*. ... <», v al Gin Ala Gly 

~ss v *- a? - - 1" i- 

«0 ^ >*= *»» ™ ^ «• «« J,:,. „" I1= 

«. — „. „ y ^ Al , L ,„ A , n iie ^ ™ ser ai> ^ u ^ 

° ly A " W "** SJ * "* ~ "X V,l Ala Leu I" 

=lu „ y Ph . ^ olu ph . Thc ^ ^ a . , ^ «• ^ 

~. I-, xi. ci„ oi„ Ar9 „„ ty , IU n . Ly , ^ 2 C l„ to 

^ J* 0 i» 0 i„ Arg vai jj. a, 9 TtP ^ „ ?;s ~ „„ Trp Mee 

-1 „, a,„ Trp ^ u j„ Arg „. TM ^ ^ H ^ ^ ^ 

TVr .In M« Tyr Asp S „ S « r Tyr A „ p ^ 2 

7 3 5 

^ He Asp W Glu Tyr Lys Lys ^ ^ Gly ^ 

XL" Ly- S.r oi„ V al Blu Asn L|u Lys Asn ^ ^ ? J" ^ ^ 

S " -o Ala AS " AS " »• A« iVb Phe He Ar § «« Cy S Ser VaX 

Thr Tyr Leu P he Lys ^ Met Leu ^ Lys ^ ™ ^ ^ ^ ^ 

Mr. ^ ASP Leu ffi Thr Lys Thr Glu iie 5 2 

815 

«» ».n 11, XI. 1^ v.! al/ olu ;;J ^ ^ ^ ^ ^ 

.'J,' ~ "« ~ ~ ~5 *" Z ryr Thr 

A.n j.„.s„ L . u ^ u Lyo 4;? Ile „ e ^ ^ ^ s ^ 

«J ... Lys „. j.„ s „ ^ mn Am Aia ^ u ^ 

- ' u S 880 

Asp Thr ser Gly Tyr Asn Ala Glu Val Aro Val rt„ » 

885 X A ^ Val GI V As P Asn. Val Gin 

890 895 

L-" 1 A,„ Thr „. Tyr Thr A „ A , p ^ ^ ^ o ^ ^ 

». «. V.1 A,„ L . u A .„ A,„ A.„ H. ^ s « r ^" „. ^ 

OXu Asn s.r ser v.l Se r PH f Trp Ile Lys u . Ser ^ 

J3 940 

Asn Ser His Asn Glu Tyr Thr Hp ti= * 

iyi lie He Asn Ser Ile Glu ^ ^ 
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3.^ 



45 



60 



960 



945 950 

Gly Trp .Lys Leu Cys He Arg Asn Gly Asn He Glu Trp He Leu Gin 
5 965 970 j 975 

Asp Val Asn Arg Lys Tyr Lys Ser Leu lie Phe Asp Tyr Ser Glu Ser 
980 985 99 0 

Leu Ser His. Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr He Thr 
995 1000 1005 

Asn Asn He Met Gly Tyr Met Lys Leu Tyr lie Asn Gly Glu Leu Lvs 
1010 1015 1020 

?i?. Ser Gln LyS Ile Glu As P Leu As P Glu Val I-ys Leu Asp Lys Thr 
1025 1030 1035 X 1040 

lie Val Phe Gly He Asp Glu Asn lie Asp Glu Asn Gin Met Leu Trp 
20 1045 1050 1055 

lie Arg Asp Phe Asn Ile Phe Ser Lys Glu Leu Ser Asn Glu Asp He 
1060 1065 1070 

Asn lie Val Tyr Glu Gly Gin He Leu Arg Asn Val He Lys Asp Tvr 
1075 1080 io85 

Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr lie He Asn Asp 
1090 1095 iioo 

^fnc Tyr Ile ASP Ar9 Tyr Ile Ala Pro Glu Ser As " Leu Val Leu 

1105 1110 1H5 ii 2 o 

Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
1125 1130 1135 

lie Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 
1^-40 1145 iiso 

Asp Asn He Ile Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met Ile 
1155 1160 lies 

Ile Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
H ? 0 ; 1175 iiso 

Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tvr 
1185 H90 1195 1200 



Gly Ile Gly lie Phe Ser Ile Lys Asn He Val Ser Lys Asn Lys Tyr 
5() 120S 1210 12 is 

Cys Ser Gin lie Phe Ser Ser Phe Arg Glu Asn Thr Met Leu Leu Ala 
1220 1225 1230 

Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn Ala Tyr Thr Pro 
1235 1240 1245 

Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
l 250 1255 126O 



Trp Lys Phe lie Ser Arg Asp Pro Gly Trp Val Glu 
1265 1270 1275 
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(2) INFORMATION FOR SEQ ID NO: 67: 

<i) SEQUENCE CHARACTERISTICS • 

<A) LENGTH: 146 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

Ui> MOLECULE TYPE : DNA (genomic) 

(ix> FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 108 . . 1460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
ACATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA G AAATAAT TT TGTTTAACTT TAAG AAG G AG ATATACC ATG GGC CAT 



Met Gly His 
1 



"I SI XI XI SI SI £j 21 SJ Sl iK s» - 

si xi is? si s? si? si si si si sj ™ - - « - 
ss is si is si 2; si Si ?s ss s? si ss si si « 

iil SI Si 2? SI S Si & Iil SI Si Si ™ J£ SI? Si 
S5 iil SIT 3i SI SI S iS ~ S SI « - « ™ 



75 80 



S Si Si SI Si iS S Sf SI 25 SI SI SI ill Si Iil 
. K SI iS SI X? SI SI SI SIT K IS ™ - - ffi » 

110 115 



iii SI Si S SI SI XI SI Si ?il ?X ili iS SI SI ?li 

Si Si si si ss ;?= si si i?i 5 s; si si is si 

140 145 



si is si a; si si si s si iii si si si is si si 

155 160 



Til SI Xi Si Si SI SI S oSi lil ili SI Si Sp IS SI 



SI S - S si SI tli il? £ Iil 5» Si SI lil ili SI 

/() 190 195 



60 
116 



260 



308 



3 56 



404 



452 



500 



546 



596 



644 



692 
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10 



50 



GGA GAA TTA AAG CAG AGT CAA AAA ATT GAA GAT TTA GAT GAG GTT AAG 74 0 

v^ly Glu Leu Lys Gin Ser Gin Lys He Glu Asp Leu Asp Glu Val Lys 
200 205 210 

TTA GAT AAA ACC ATA GTA TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT 78 fi 

Leu As P Thr He Val Phe Gly lie Asp Glu Asn lie Asp Glu Asn 

215 220 ' 225 

CAG ATG CTT TGG ATT AG A GAT TTT AAT ATT TTT TCT AAA GAA TTA AGT fll* 
Gin Met Leu Trp He Arg Asp Phe Asn lie Phe Ser Lys Glu Leu Se- 
230 235 240 



884 



932 



AAT GAA GAT ATT AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT 
Asn Glu Asp He Asn He Val Tyr Glu Gly Gin He Leu Arg Asn Val 
245 250 255 

ATT AAA GAT TAT TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT 
lie uys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tvr 
20 26S 270 275 

ATT ATT AAT GAT AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT 9flo 
He He Asn Asp Asn Tyr He Asp Arg Tyr He Ala Pro Glu Ser Asn 
280 285 290 

25 GTA CTT GTA CTT GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA 1028 

/al teu Val Leu Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly 
295 300 30S 

™ ■^ AT A T T ACT ATT TCA GTA TCT GAT AAT CCT TAT AGT AGA 1076 

M) Asn Pro He Thr He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arq 

310 315 320 

ATT TTA AAT GGA GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG 1124 
ile Leu Asn Gly Asp Asn He He Leu His Met Leu Tyr Asn Ser Arq 
-° 325 330 335 * 

AAA TAT ATG ATA ATA AGA GAT ACT GAT ACA ATA TAT GCA ACA CAA GGA 11 72 

Lys Tyr Met Ile Ile Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly 
4() 340 345 350 355 

GGA GAG TCT TCA CAA AAT TGT GTA TAT GCA TTA AAA TTA CAG AGT AAT 12 2 0 

Gly Glu Cys Ser Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn 
360 365 370 

4> TTA GGT AAT TAT GGT ATA GGT ATA TTT AGT ATA AAA AAT ATT GTA TCT 126 8 

Leu Gly Asn Tyr Gly Ile Gly He Phe Ser He Lys Asn lie Val Ser 
375 380 385 



AAA AAT AAA TAT TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA 1316 

Lys Asn Lys Tyr Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr 
390 395 400 

ATG CTT CTA GCA GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT 13 64 

Met Leu Leu Ala Asp Ile Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn 
" 4 05 410 415 

GCA TAC ACG CCA GTT GCA GTA ACT AAT TAT GAA ACA AAA CTA TTA TCA 1412 

Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser 

60 420 425 430 435 

ACT TCA TCT TTT TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG 146 0 

lhr Ser Seir p he Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 
440 445 450 

65 TAAAAGCTT 

(2) INFORMATION FOR SEQ ID NO: 68: 



1469 



70 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 51 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68- ' 
Mot ci y His His His His His His His His His His Ser Ser Cl y His 

15 

g.« c ly A , ? „ is „ et M , s „ M „ au ^ ^ ^ ^ 

>""•**»» S " "* «~ - »« - L» „n 

4 5 

Asn L ys Lys ^ Ala Leu v ., Agp ^ ^ a ^ ^ 

A |f Val GX y Asp Asn Val Gin Leu Asn Thr Ile £ ^ ^ ^ ^ 
LXS Ser S " S Is r GlY Asp Lys Ile -x" Asn Leu Asn ^ £ 

He Leu Ty r ser Ala xle Tyr OXu Asn Ser Ser V aX ser Phe Trp IXe 
Lys lie se, Lys Asp Leu Thr ft 3„ ser ^ ^ 



125 



Asn se, Ue OXu oXn A sn Ser 0X y Trp Lys Leu ^ Jle Arg ^ ^ 

HS . I1C T ^ Si;-™" *«P ™ A.„ Arf Xyr Lys s . r 

He Phe Asp Tyr ; „ Glu ser Leu ^ t ^ t ^ ^ ^ 

Trp Pne P he vax Thr XXe Tnr A sn Asn XXe Me t Gly Tyr He, ^ Leu 

185 190 
Tyr IXe As. Gl y Glu Leu >ys Gin Ser Gin L ys xi e G i u Asp Leu ftsp 

Glu vax Lys Uu A sp Lys T?r Ile Val phe Qly ^ J" ^ " 

220 

22 5 Asn Gin Met Leu Tr P Xle Arg Asp ph| As „ ^ ^ ^ ^ 

Glu Leu Ser Asn ? 1« A s P XXe Asn XXe Val Tyr Clu oly Gin Xle ^ 
A, g Asn Val zxe L ys Asp Tyr Xrp Gly Asn pro ^ ^ ^ ^ ^ 

Glu Tyr Tyr Xle Xle A sn Asp A , n Tyl - Ile Asp ftrg ^ ^ ^ 

*-80 285 

Glu Ser Asn Val Leu Val Leu Val Ara Tv-r t> m a 

290 aA Ar ^ Pro Asp Arg Ser Lys Leu 

Tyr Thr aiy Asn Pro tie Th r Ile L ys ser Val Ser Asp Lys Asn Pro 

T y r Ser Arg tie Leu Asn Gly Asp Asn „ e ^ ^ ^ ^ ^ ^ 



330 335 
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Asn Ser Arg Lys Tyr Met He Ile .Arg Asp Thr Asp Thr He Tyr Ala 
340 345 350 

Thr Gin Gly Gly Glu Cys Ser Gin Asn Cys Val Tyr Ala/Leu Lys Leu 
.355 360 3651 

Gin Ser Asn Leu Gly Asn Tyr Gly He Gly lie Phe Ser He Lys Asn 
370 375 380 

lie Val ser Lys Asn Lys Tyr Cys Ser Gin lie Phe Ser Ser Phe Ar<? 
38S 390 395 400 

Glu Asn Thr Met Leu Leu Ala Asp lie Tyr Lys Pro Trp Arg Phe Ser 
405 410 415 

Phe Lys Asn Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lvs 
420 425 430 

Leu Leu Ser Thr Ser Ser Phe Trp Lys Phe He Ser Arg Asp Pro Glv 
4 35 440 445 

Trp Val Glu 
4 50 

(2) INFORMATION FOR SEQ ID NO: 69: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

( D) TOPOLOGY : 1 inea r 

<ii) MOLECULE TYPE: other nucleic acid 
<A> DESCRIPTION: /desc = "DNA" 

. <xi> SEQUENCE DESCRIPTION: SEQ ID NO : 6 9 : 

GCAAGCTTTT ACTCTACCCA TCCTGGATCC CT 

i2) INFORMATION FOR SEQ ID NO: 70: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3825 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
<B> LOCATION: 1..3822 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATG CCA GTT GCA ATA AAT AGT TTT AAT TAT AAT GAC CCT GTT AAT GAT 
Met Pro Val Ala He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp 
1 5 10 is 

GAT ACA ATT TTA TAC ATG CAG ATA CCA TAT GAA GAA AAA AGT AAA AAA 
Asp Thr lie Leu Tyr Met Gin He Pro Tyr Glu Glu Lvs Ser Lys Lvs 
20 25 30 



AGA AAT ACA ATA GGA ACG AAT CCT AGT GAT TTT GAT CCA CCG GCT TCA 
Arg Asn Thr lie Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
50 55 60 



32 



46 



ye 



TAT TAT AAA GCT TTT GAG ATT ATG CGT AAT GTT TGG ATA ATT CCT GAG 144 
Tyr Tyr Lys Ala Phe Glu lie Met Arg Asn Val Trp He He Pro Glu 
35 40 45 



192 
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. & »' !£ SS '« E E ** "J «j - JJT ^ TTA *C ** 

65 70 Y Asp Pro Asn Tyr Leu Thr Thr 

80 

GAT GCT GAA AAA GAT AGA TAT ttb _ 

«p «. «. , y , Ts AC3 E - E £ £ K E Si E E 
S E E E & ESS? E £ E E SK SK S E 
S S E EE E SS EE ffi g g E E SK E 
E K 511 EEs S E EE K E E EE E 
E SK E E E E S £. E E E E SS S S!J E 

^■•^ 160 
GAT ATA TTT GAA AGT TGT TGT TAC CCC rrr a nn „ 

*.p n. *» Glu f „ Cy , Cyo ^ =« £j gj jjj ™ ™ ccj . 

E E 511 E E E E JS E SS E s» ■»»■«* « « 

180 J y | Phe. Gly Ser lie Asn He 

2? S E E E SK E SKE E E r °" *" «» « 

195 Y 2^ lyr Thr Phe Asn Asp H e Ser Gly 

S?5 S E E E £ SK E E S S E 2 S K E 
EE E s: E E S S Ef£E E E E ||| 
2? 5" EE Si SK £ S SK -S ESK 2* S" m ™ 

24 5 xu vax Lys Gin Ala Pro Leu Met 

"° 255 

ATA GCC GAA AAA CCC ATA arr ptx 

He Ala Glu Lys p r £ J£ [™ «A TTT TTA ACC TTT GGA GGT 

260 ^ 265 Phe Gly Gl y 

as Asp Hu js ?n *n s s us r ^ ™ ™ — 

275 ^ ||J Ala Met Glu Lys H e Ty r 

E E E SI E E SK £ S E S S E E SK E 
E E E E S SK E E K E SK E !K E E E. 
SK E E E E'E E E E E E Sf AOT T " *™ «» 

325 7 " As P G1 V Se ^ Tyr Thr Val 

JJ0 335 
AAT GAA AAT AAA TTT AAT GAA ATT TAT AAA ™* 

Asn Glu Asn Lys Phe Asn GlU lH ^ " A J AT AGT "T ACA 

lyr Lys Lys Leu Tyr Ser Phe Thr 



240 
288 
336 
384 
432 
480 
528 
576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 
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340 34 5 350 

GAG AGT GAC TTA GCA AAT AAA TTT AAA GTA AAA TGT AGA AAT ACT TAT H04 
Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tvr 
355 360 36§ 

TTT ATT AAA TAT GAA TTT TTA AAA GTT CCA AAT TTG TTA GAT GAT GAT U52 
Phe lie Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 380 

ATT TAT ACT GTA TCA GAG GGG TTT AAT ATA GGT AAT TTA GCA GTA AAC 1200 
lie Tyr Thr Val Ser Glu Gly Phe Asn He Gly Asn Leu Ala Val Asn 
385 390 395 40 0 

AAT CGC GGA CAA AGT ATA AAG TTA AAT CCT AAA ATT ATT GAT TCC ATT 1248 
Asn Arg Gly Gin Ser He Lys Leu Asn Pro Lys He . He Asp Ser He 
405 410 415 . 

CCA GAT AAA GGT CTA GTA GAA AAG ATC GTT AAA TTT TGT AAG AGC GTT 12 96 

Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 
420 425 430 



ATT CCT AGA AAA GGT ACA AAG GCG CCA CCG CGA CTA TGC ATT AGA GTA 
lie Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys lie Arg Val 
435 440 445 



1344 



AAT AAT AGT GAG TTA TTT TTT GTA GCT TCA GAA AGT AGC TAT AAT GAA 13 92 

Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
450 455 460 

AAT GAT ATT AAT ACA CCT AAA GAA ATT GAC GAT ACA ACA AAT CTA AAT 14 4 0 

Asn Asp He Asn Thr Pro Lys Glu lie Asp Asp Thr Thr Asn Leu Asn 
465 470 475 480 

3:> AAT AAT TAT AGA AAT AAT TTA GAT GAA GTT ATT TTA GAT TAT AAT ACT i4 86 

Asn Asn Tyr Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 



CAG ACA ATA CCT CAA ATA TCA AAT CGA ACA TTA AAT ACA CTT GTA CAA 1536 
Gin Thr He Pro Gin He Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
500 505 510 



GAC AAT AGT TAT GTG CCA AGA TAT GAT TCT AAT GGA ACA AGT GAA ATA 15 84 

Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
4;> 515 520 525 



GAG. GAA TAT GAT GTT GTT GAC TTT AAT GTA TTT TTC TAT TTA CAT GCA 16 32 

Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
530 535 540 

CAA AAA GTG CCA GAA GGT GAA ACC AAT ATA AGT TTA ACT TCT TCA ATT 1680 
Gin Lys Val Pro Glu Gly Glu Thr Asn He Ser Leu Thr Ser Ser lie 
545 550 555 560 

55 GAT ACA GCA TTA TTA GAA GAA TCC AAA GAT ATA TTT TTT TCT TCA GAG 1728 

Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp He Phe Phe Ser Ser Glu 
565 570 575 



TTT ATC GAT ACT ATC AAT AAA CCT GTA AAT GCA GCA CTA TTT ATA GAT 17 7 6 

Phe He Asp Thr He Asn Lys Pro Val Asn Ala Ala Leu Phe He Asp 
580 585 590 



TGG ATA AGC AAA GTA ATA AGA GAT TTT ACC ACT GAA GCT ACA CAA AAA 1624 
Trp He Ser Lys Val lie Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
0:> 595 600 605 



70 



AGT ACT GTT GAT AAG ATT GCA GAC ATA TCT TTA ATT GTA CCC TAT GTA 18 72 

Ser Thr Val Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr Val 
610 615 620 
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5 SSE Si SS SSSj ™ K 25 S « - 5s « 
.» Si SI £ E S SS SS|? IS S?S « E S s 
,, 2J E g SS HI K HI HS S 5 S S HI HI SI 

ss ?s.si is SS S Hi S° HI SS •» £1 S S £ St 

20 " b 700 

i hi s s 9 ts hi s- si si hi hi s: ss 

* S£SS 31 S! S SS SI - - s . ~ 



730 73S 



K SS SI HS HI HI 31 ?S SI HI S! HI r i" " 

740 iV; P Giu Asn.Arg Leu 



74S 7S0 



- iX HI 31 HI HI SK SI HIS HI SS Asn 31 tS S 

» ' 2s si hi Sr. ?s si si s: ss s hi ss si 21 S ™ 

825 830 

vll tS Ser Thr TIu ^ ^ ^ ATT CCA TTT GAA CTT TCT TCA TAT 
55 31 afs ^ Leu : Aon Ser |er He Pro phe Glu Leu Ser Ser ?Jr 

840 845 

Thr S£ £J J£ J™ ™ A H "It T^r pT ^ AGA ™ TAT *»* 

850 ^ ass 7 PhS AS " Ar£f Leu T W Lys 

60 B " 860 

?H hs 2; is ei is si '2; si? s - «: si m hi si 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



35 SS SI SI 31 HI !S HI HI S SI Si SI SS HI HI HI 
- IS SI S S? HI HI ?TI SI HJ SI 51? S SI X E 

40 775 780 

S SI 31 SI SI? III SI "I HI SI S HI SSI HI SI 

? 95 000 



2400 



2448 



2496 



2S44 



2592 



2640 



875 880 
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ATA GAT ATC TCT GGA TAT GGT TCA AAT ATA AGC ATT AAT GGA AAC GTA 2688 
lie Asp lie Ser Gly Tyr Gly Ser Asn lie Ser He Asn Gly Asn Val 
885 890 89S 

5 TAT ATT TAT TCA ACA AAT AGA AAT CAA TTT GGA ATA TAT AAT ACT AGG 2736 

Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly lie Tyr Asn Ser Arg 
900 90S ' 910 

If) f TT ^ GT 2^ GT T WT ATA GCT CAA AAT AAT GAT ATT ATA TAC AAT AGT 2784 

H> Leu Ser.Glu Val Asn He Ala Gin Asn Asn Asp He He Tyr Asn Ser 

915 920 925 

AGA TAT CAA AAT TTT AGT ATT AGT TTC TGG GTA AGG ATT CCT AAA CAC 

IS ZXn Ile Ser Phe Tr P VaX Ar 9 Ile Pro His 

930 935 940 



20 



40 



50 



60 



TAC AAA CCT ATG AAT CAT AAT CGG GAA TAC ACT ATA ATA AAT TGT ATG 
Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr Ile He Asn Cys Met 
945 .950 955 960 

GGG AAT AAT AAT TCG GGA TGG AAA ATA TCA CTT AGA ACT GTT AGA GAT 
Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr Val Arg Asp 
96 5 970 975 

TGT GAA ATA ATT TGG ACT TTA CAA GAT ACT TCT GGA AAT AAG GAA AAT 
Cys Glu lie Ile Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 
980 9 85 990 



2 8 32 



2880 



2926 



2 976 



1(1 , y T AGG TAT GAA GAA CTT AAT AGG ATA TCT AAT TAT ATA AAT 3024 

Ml Leu Ile Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn Tyr lie Asn 

"5 1000 1005 

AAA TGG ATT TTT GTA ACT ATT ACT AAT AAT AGA TTA GGC AAT TCT AGA 3072 
Lys Trp lie Phe Val Thr Ile Thr Asn Asn Arg Leu Gly Asn Ser Arq 
*° 1015 1020 

ATT TAC ATC AAT GGA AAT TTA ATA GTT GAA AAA TCA ATT TCG AAT TTA 

?itr Tyr Iie Asn Gly Asn Leu Iie Val Glu L V S Ser ri « s «r Asn Leu 
1025 1030 JL035 1O40 

GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT GTT GGT TGT 
Gly Asp lie His Val Ser Asp Asn Ile Leu Phe Lys Ile Val Gly Cys 
1045 _ 1050 105S 

4^ GAT GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT TTT AAT ACG 

Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lys Val Phe Asn Thr 
1060 1065 1070 

GAA TTA GAT AAA ACA GAA ATT GAG ACT TTA TAT AGT AAT GAG CCA GAT 3 26 4 

^lu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn Glu Pro Asp 
1075 1080 1085 

CCA AGT ATC TTA AAA AAC TAT TGG GGA AAT TAT TTG CTA TAT AAT AAA 

Leu 1 
1100 



3120 



316 8 



3216 



3312 



^ Pr ° IlG Leu Lys Asn Tr P Gi V Asn Tyr Leu Leu Tyr Asn Lys 

1090 1095 ---- ' 7 

AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT ATT ACT CTG 
Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr He Thr Leu 
1105 1"0 iii 5 il20 

AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT ACT GAA GGC 
Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val Thr Glu Gly 
1125 H30 H35 

65 TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA GTC ATT ATA 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val He Ile 
H- 4 " 1145 1150 

AGA AAA AAT GGT CCT ATA GAT ATA TCT AAT ACA GAT AAT TTT GTT AGA 3 504 

/W Arg Lys Asn Gly Pro He Asp Ile Ser Asn Thr Asp Asn Phe Val Arg 
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1X55 



1160 



1165 



10 



15 



20 



J:> 



40 



45 



50 



:>:> 



60 



65 



70 



'K'-J2£15'S SI ^.K-JS'Sf "J f AT CGT «* «A GAA TAT 
1170 7 ii« Val Ar 9 G1 y Val Glu Tyr 

1175 1180 | y 

S.Z2.SI K JS S S-S « {2 » - s 
21 SI EES 5S K S5 K S'si K £ E £° 
S? J= SI £ S IK SI E SI SI 2g - siW 

ss? s s s; e a: e si si s sir s e ™ « 

A ^ 40 1245 

iS SS SI ffi 25- iS IS E X 2* SI 3? ~ ™ ™ 

A * 4:>5 1260 
TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TP A 
Serlie Ser LyS Glu Asn Gly Trp Lys G^u 
Afc6b 1270 

(2) INFORMATION FOR SEQ ID NO:7l; 

ii) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 1274 amino'acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) -MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71": 
Me, Pro Val Ala II. Asn Ser Phe ^ - ^ Asn ^ ^ ^ ■ 

Asp Thr lie Leu Tyr Met Gln Ile p„ ^ Glu ^ Lys ^ ^ 

TVr Tyr Lys Ala Phe Clu He Met Ar g Asn Val Trp lie 11^ Pro Olu 

Arg Asn Thr He Cly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 

b5 60 
L |U Lys Asn Gly Ser Ser Ala Tyr T yr ^ p „ Agn Tyr ^ ^ ^ 

Asp Ala 61u Lys A| p Arg Tyr Leu Lyg ^ ^ ne ^ ^ £ 
Ar 9 lie Asn ser Asn Pro Ala Gl y Lys Val Leu Leu Gln ^ £ ^ 

Tyr Ala Lys Pro T yr Leu Gly Asn Asp His Thr Pro lie Asp Glu Phe 

iJO 125 
Ser Pro Val Thr Arg Thr Thr Ser Va! Asn He Lys Leu Ser Thr Asn 

iJb 140 
V.1 Glu Ser Ser Met Leu Leu Asn Leu Leu Val Leu oiy Ala Gly Pro 

155 160 



3 552 



3600 



3648 



36 96 



3744 



3792 



3825 
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Asp He Phe Glu ser Cys Cys Tyr Pro Val Arg Lys Leu He Asp Pro 
165 .170 . 175 

Asp Val Val Tyr Asp Pro Ser Asn Tyr Gly Phe Glv Ser lie Asn He 
180 185 * 1 190 

Val Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp He ser Glv 
195 2O0 205 

Gly His Asn Ser Ser Thr Glu Ser Phe lie Ala Asp Pro Ala He Ser 
610 215 220 



Leu Ala His Glu Leu lie His Ala Leu His Gly Leu Tyr Gly Ala Arc 

" 5 230 235 24 5 

Gly Val Thr Tyr Glu Glu Thr He Glu Val Lys Gin Ala Pro Leu Met 

245 250 



255 



He Ala Glu Lys Pro He Arg Leu Glu Glu Phe Leu Thr Phe Gly Glv 
260 265 270 

Gin Asp Leu Asn He He Thr Ser Ala Met Lys Glu Lys lie Tyr Asn 
-75 280 285 

Asn Leu Leu Ala Asn Tyr Glu Lys He Ala Thr Arg Leu Ser Glu Val 
~ 90 295 3oo 

Asn Ser Ala Pro Pro Glu Tyr Asp lie Asn Glu Tyr Lys Asp Tyr Phe 
J ° J 310 315 320 

Gin Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Glv Ser Tyr Thr Val 
325 330 335 

Asn Glu Asn Lys Phe Asn Glu He Tyr Lys Lys Leu Tyr Ser Phe Thr 
340 34S 350 

Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tvr 
355 360 365 

Phe He Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
30 37 5 380 

lie Tyr Thr Val Ser Glu Gly Phe Asn lie Gly Asn Leu Ala Val Asn 
385 390 395 400 

Asn Arg Gly Gin Ser He Lys . Leu Asn Pro Lys He He Asp Ser He 
405 410 415 

Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Vai 
420 425 430 

He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arg Val 
435 440 445 

Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
450 455 460 

Asn Asp He Asn Thr Pro Lys Glu He Asp Asp Thr Thr Asn Leu Asn 
465 470 475 480 

Asn Asn T?r Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 

Gin Thr He Pro Gin lie Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
500 505 510 

Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
515 520 525 

Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
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530 535 

54 0 



■ «. ^ ,„ Mu ^ Glu Ihr A=n n> ^ ^ Thr S « s „ tl . 

" P K «" ? ., >„. S e t s„ I" 

». zr. „„ Lya „,„ s , „. A1 , L>u ^ 

Trp ' " e Hi Ll " Vil "* *■* is Ph < Th * «; ™° «. ^ 

«" TJ, „ i, ^ IU Asp L((u ju ™ ' 

620 ■ 

Oly Leu Ala Leu Asn Tl* r,~ n ~, 

625 630 - IXe IlG G1U Ala ^ H» Asn Phe Glu 

Glu Ala Phe Glu Leu Leu Gly Val Glv II. r „ , 

64 5 Y ly Leu Leu Glu Phe Val Pro 

0 655 

:° lu st v>i «** «• ~ Tto „. Ly , ser Tyr lu 

S 670 

"P.-.TVJ M~ «. -Jj. », „^ y , A1 , .J. A „ 

- „« » u Arg 01u ^ Trp Ly , ^ ^ £ ^ 

"J A5 ° TF "~ & «» - 1° -v= «, i„ „„ 

«» « Tv, C1 „ „. <,„ G1 „ A= „ G1 „ ^ n| ^ ^ 2 

TV. Tyr _ ^ ^ ^ s ^ Asp ^ ^ ™ ^ 

•1- ■« „„ ^ s .„ ^ ne mu ^ ^ ™ ^ ^ 

»; l™ «, „« Lyo a „ clu Arg ph> ^ ■ mu ^ ^ 
»; M« t* ,,„„., ^ „,,„ Asn olu ™ vaj =iy ^ ^ 

. 5 800 

-y. ,y, ryr t , p Hls val ^ fcT ^ ^ 

«P HI. ^ S.r , le L=u Gly 01 „ ^ A<n Mu ^ s ^ ^ S 

°" 830 
Val Thr Ser Thr Leu Asn <^r- n 

835 ^ Ser Hi Ile Phe Glu Leu Ser Ser Tyr 

84 5 

Thr j.„ Asp Lys xi. Leu Ue Xl e Tyr Phe Asn ^ Leu . fyr ^ 
"a Lvs Asp Ser ser Leu Asp ^ t ^ 12 Asn ^ ^ ^ 

»• Asp „. S6r GJy Tyr oly s . r A>n js n _ ^ ^ 

Tyr n. Tyr ? „ Thr A ,„ 4 ,„ As „ ?; „ phe My ii# ^ 
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Leu Ser Glu Val Asn He Ala Gin Asn Asn Asp lie lie Tyr Asn Ser 
915 920 925 

Arg Tyr Gin Asn Phe Ser lie Ser Phe Trp Val Arg lie Pro Lys His 
930 935 940 

Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He Asn Cys Met 
945 950 955 9 60 

Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr Val Arq Asn 
965 970 975 

Cys Glu He He Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 
980 985 990 

Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn Tyr He Asn 
995 1000 1005 

Lys Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly Asn Ser Aro 
i010 1015 1020 

He Tyr He Asn Gly Asn Leu He Val Glu Lvs Ser lie Ser Asn Leu 
102 * 1030 i£)35 1040 

Gly Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Gly Cys 
1045 1050 1055 

Asp Asp Glu Thr Tyr Val Gly He Arg Tvr Phe Lys Val Phe Asn Thr 
1060 1065 * * 1070 

Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn Glu Pro Asp 
. i07S 1080 1085 

Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu Tyr Asn Lys 
1090 1095 1100 

Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr lie Thr Leu 
1105 mo ins 1120 

Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val Thr Glu Gly 
1125 H30 1135 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val He lie 
1140 H4S 1150 

Arg L.ys Asn Gly Pro lie Asp lie Ser Asn Thr Asp Asn Phe Val Arg 
H55 H60 H65 

Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Gly Val Glu Tyr 
11? 0 1175 H80 

Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He He Arg Thr 
1185 H90 H95 i 200 

Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met Asp Ser He 
1205 1210 1215 

Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly Ser Asn He 
1220 1225 1230 

Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser Ser Trp Tyr 
1235 1240 1245 

Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys Phe Trp Ser 
1250 1255 1260 

Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
1265 1270 

(2) INFORMATION FOR SEQ ID NO: 72: 
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(x) SEQUENCE CHARACTERISTICS- 

B T^E TH: i ?«9 r bas « Pairs 
its> TYPE: nucleic acid 

C STRANDEDNESS: double 
' TOPOLOGY: linear , ' 

<ii> MOLECULE TYPE: DMA (genomic) 
<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 108.. 14 5 X 

SEQUENCE DESCRIPTION : SEQ ID N0 . 7 ,. 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACA 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAArv » ^GGATAACAA 

1 TTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 



* K . ?* SI »" m s« s? E « SI E - ~ 

g-E SS£ s-s E a-S E Tyr E JffJSE 

— S =? = s s a es s s'ffi s 
s- » si e ss s e e jg e e e e e £ s 

~ " $ - - - ™ « S S JK Si E E E E 
JS E E E E EE S » E - ^ ^ „ T „ .„ 

U « - - a - - E e e e e sit e s 

S £ E Tyr Lyi E E E E E E E J* ** ™ ™ 

120 AS ~[} Ar 9 Glu Tyr Thr Il e H e 

AAT TGT ATG GGG AAT AAT AAT tpp ^ 130 

-..c, « ojv E E E E 5 E E E E E E 
SS E .E E E E E E E IT E ■» «« ^ ~ « 

150 isf ^ LSU Gln As P Thr Ser Gly ^ 

160 

AAG GAA AAT TTA att t-t r.^~ 

^ «„ «. . u E E E E E E E E E E E E 

TAT ATA AAT AAA TGG ATT TTT pta r ^ 

jvj ». ^ «, „: E Sit E E E E E E E E 
- T E ESS? E E E E E E E E E E 



60 
116 

164 

212 
260 
308 

356. 

404 

452 



500 



548 



596 



644 



692 



740 
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(2) INFORMATION FOR SEQ ID NO: 73: 

65 <i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 448 amino acids 
(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



932 



200 205 210 

TCG AAT TTA GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT 788 
Ser Asn Leu Gly Asp lie His Val Ser Asp Asn lie Leu Phe Lys lie 
215 220 225 

GTT GGT TGT GAT GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT 836 
Val Gly Cys Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lys Val 
230 235 240 

TTT AAT ACG GAA TTA GAT AAA AC A GAA ATT GAG ACT TTA TAT AGT AAT 884 
Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn 
245 250 255 

f> GAG CCA GAT CCA AGT ATC TTA AAA AAC TAT TGG GGA AAT TAT TTG CTA 

Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu 
260 265 270 275 

TAT AAT AAA AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT 98 0 

Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr 
280 285 290 

ATT ACT CTG AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT 1028 
He Thr Leu Asn Ser Gly lie Leu Asn He Asn Gin Gin Arg Gly Val 
295 300 305 

ACT GAA GGC TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA 1076 
. hr Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Glv Val Glu 
310 315 320 

GTC ATT ATA AGA AAA AAT GGT CCT ATA GAT ATA TCT AAT AC A GAT AAT 1124 
Val He He Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn 
325 330 335 

TTT GTT AGA AAA AAC GAT CTA GCA TAG ATT AAT GTA GTA GAT CGT GGT 1172 
Phe Val Arg Lys Asn Asp Leu Ala Tyr lie Asn Val Val Asp Arq Glv 
340 345 350 355 

GTA GAA TAT CGG TTA TAT GCT GAT ACA AAA TCA GAG AAA GAG AAA ATA 122 0 

Val Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lvs Glu Lys He 
360 365 * 370 

ATA AGA ACA TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG 126 8 

He Arg Thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met 
3"?5 380 385 ' 

GAT TCA ATA GGA AAT AAT TGC ACA ATG AAT TTT CAA AAC AAT AAT GGG 1316 
Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly 
390 3 95 400 

AGC AAT ATA GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT 1364 
Ser Asn lie Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser 
405 410 415 

55 AGT TGG TAT TAT AAC AAT ATA CGA AGA AAT ACT AGC AGT AAT GGA TGC 14 12 

Ser Trp Tyr Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys 
420 425 430 435 

TTT TGG AGT TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGAAAGCTT 14 6 0 

Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu. 
440 445 



70 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO- 73 ' 

~ oiy H1 * m ' Hl i - «- — ~ ~ = ly „ ls 

U. .«M «y B1 , „« S1 . s „ „„ ^ ^ ^ ^ ^ ^ 

tS " ^ % - - - «. - „. e Arg 

T„ =Xu A „ _ Ly , „. Bj ^ ^ ^ ^ ^ 

« "* *~ oly v 'i »* - ~. ^ 

a. ryr ^J ^ W^ „,„ v„ As „.... Ala cln _ J" 

S - »• "-.IT*- - «*..«; P*. s « s „ ph " Trp 

- „, xj. Pro , yo Hls Iyr ^ pr<> (fee ^ - 

«- ... C, „. t » y A ,„ as „ ^ ™ L ^ ^ ^ 

S A1 ' 9 Thr "«p ty oiu ... ,u -r„ Th! ,. 

x ^ . 150 ie *jf| Thr L «u Gin Asp Thr 

. ys „j „„„ „. „„. ^ Ty< ^ ^ »• 

X.. s« — -jvj ... As „ Ly « „ p ph . val mr ^ ^ 

190 

jjy A „„ s „ Ar , ^ ^ n ^ ^ ^ 

Lv * s "* s " - "- fir ~< »• «• v. j.j I" ,« u 

"* Ly ° val irs *- «» vl. c. y ... Ar , Tyr 

- .V. v-i PH. jjj Tnr »„ ^ A , p m> T ^ - 

255 

Tyr Ser Asn Glu Pro Asp Pro Ser ti« r 

260 P SGr Hi ASR T ^ r Tr P Gly Asn 

ry< ta „ jjj. Tyr A ,„ tys Lyo ffi ^ ^ ^ L ^ ^ 

A.P Lj. Tyr ... Thr ^ A s . r „ y „. „.„ Asn ^ 

300 

«| o. y v .i » u „ y s „ val „, # Leu ^ ^ t ^ mu 

v.. „» », „. „. Arg ^ ^ iu ^ ^ 

«p A .„ pj. va, Ar9 , ys A , n ^ Lcu A1< ^ ^ ^ HI ^ 
-P M- 5 „,» „. Tyc Arg T „ ai> ^ ™ 
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Glu Lys He He: Arg thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He 
370 375 380 

He Val Met Asp Ser lie Gly Asn Asn Cys Thr Met Asn, Phe Gin Asn 
385 390 395 I 400 

Asn Asn Gly Ser Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu 
405 410 415 

Val Ala Ser Ser Trp Tyr Tyr Asn Asn He Arg Arg Asn Thr Ser Ser 
420 425 430 

Asn Gly Cys Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO:74: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = ■• DNA " 

SEQUENCE DESCRIPTION: SEQ ID NO:74: 

CGCCATGGCT ATTCTAATTA TATATTTTAA TAG 

. ( 2) INFORMATION FOR SEQ ID NO:75; 

(iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

' . ■ ■■ (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA* 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

GCAAGCTTTC ATTCTTTCCA TCCATTCTC 

'.?.) INFORMATION FOR SEQ ID NO: 76: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 94 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..3891 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

ATG CCA GTT AAT ATA AAA AAC TTT AAT TAT AAT GAC CCT ATT AAT AAT 
Met Pro Val Asn He Lys Asn Phe Asn Tyr Asn Asp Pro He Asn Asn 
1 5 10 15 

GAT GAC ATT ATT ATG ATG GAA CCA TTC AAT GAC CCA GGG CCA GGA ACA 
Asp Asp He He Met Met Glu Pro Phe Asn Asp Pro Gly Pro Gly Thr 
20 25 30 

TAT TAT AAA GCT TTT AGG ATT ATA GAT CGT ATT TGG ATA GTA CCA GAA 
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288 
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384 
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15 



40 



50 



60 



65 



CAA ATA TAT AAA AAT AAA TAT GAT TTT GTT GAA GAT CCT AAT GGA AAA 
Gin lie Tyr Lys.Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

TAT AGT GTA GAT AAG GAT AAG TTT GAT AAA TTA TAT AAG GCC TTA ATG 
Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 ' 350 



AGG TAT TCT TAT TTT AGT GAA TAT TTG CCA CCG ATA AAA ACT GAA AAA 
Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro lie Lys Thr Glu Lvs 
370 375 380 



TTG TTA GAC AAT ACA ATT TAT ACT CAA AAT GAA GGC TTT AAC ATA GCT 
Leu Leu Asp Asn Thr lie Tyr Thr Gin Asn Glu Gly Phe Asn He Ala 
385 390 395 400 

AGT AAA AAT CTC AAA ACG GAA TTT AAT GGT CAG AAT AAG GCG GTA AAT 
Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 
405 410 415 



CAG TGt ATT ATT GTT AAT AAT GAG GAT TTA TTT TTC ATA GCT AAT AAA 
Gin Cys He He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lvs 
^ 450 455 460 



480 



ACA CAA AAT AAT ACT ATA GAA AAT AAT TTT TCT ATA GAT CAG TTG ATT 
Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser He Asp Gin Leu He 
485 490 495 



560 



ACG AAT TCA TTA AAT GAT GCT TTA AGA AAT AAT AAT AAA GTC TAT ACT 
Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 
565 570 575 

TTT TTT TCT ACA AAC CTT GTT GAA AAA GCT AAT ACA GTT GTA GGT GCT 
Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Giv Ala 
S80 585 590 



1008 



1056 



TTT GGC TTT ACT GAA ACT AAT CTA GCT GGT GAA TAT GGA ATA AAA ACT 1104 
IU Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly He Lvs Thr 

355 360 365 



1152 



1200 



1248 



-> AAA GAG GCT TAT GAA GAA ATC AGC CTA GAA CAT CTC GTT ATA TAT AGA 12 96 

Lys Glu Ala Tyr Glu Glu He Ser Leu Glu His Leu Val He Tyr Arq 
420 425 430 

ATA GCA ATG TGC AAG CCT GTA ATG TAC AAA AAT ACC GGT AAA TCT GAA 1344 
M> He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 

435 440 445 



139: 



GAT AGT TTT TCA AAA GAT TTA GCT AAA GCA GAA ACT ATA GCA TAT AAT 1,4 4 0 

Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr lie Ala Tyr Asn 
465 470 475 



1488 



4? TTA GAT AAT GAT TTA AGC ACT GGC ATA GAC TTA CCA AAT GAA AAC ACA 15 36 

L,eu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 505 510 

GAA CCA TTT ACA AAT TTT GAC GAC ATA GAT ATC CCT GTG TAT ATT AAA 158 4 

Glu Pro Phe Thr Asn Phe Asp Asp He Asp He Pro Val Tyr He Lys 
515 520 525 

CAA TCT GCT TTA AAA AAA ATT TTT GTG GAT GGA GAT AGC CTT TTT GAA 16 3 2 

Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 
^ 530 535 540 

TAT TTA CAT GCT CAA ACA TTT CCT TCT AAT ATA GAA AAT CTA CAA CTA 16 8 0 

Tyr Leu His Ala Gin Thr Phe Pro Ser Asn He Glu Asn Leu Gin Leu 
545 550 555 



1728 



1776 



TCA CTT TTT GTA AAC TGG GTA AAA GGA GTA ATA GAT GAT TTT ACA TCT 1824 
/U . Ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
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ATA GAT TCA TCT GGA TAT GGT GCA ACT ATG AAT GTA GGT TCA GAT GTT 
He Asp Ser Ser Gly Tyr GXy Ala Thr Met Asn Val Gly Ser Asp VaT 
885 890 895 

iff ITi ^ ° AT A T A GGA AAT CAA TTT AAA TTA AAT> AAT TCT GAA 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Leu Asn Asn ser Glu 
900 90S 910 

AAT AGT AAT ATT ACG GCA CAT CAA ACT AAA TTC GTT GTA TAT GAT ACT 
Asn Ser Asn lie Thr Ala His Gin Ser Lys Phe Val Val ™l aJJ Kr 

* L - 3 920 92S 



45 



GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT AAT ATT TTT rry ar-A 
Thi ' Thr ft» «« ^al Trp II. Lys Asp S £n ill pII g?v ArJ 
1060 * 1065 1070 " . 



£h A OS? TTA *** GAT TTT TGG GGG CCT TTA AGA TAC GAT AC A 

55 A 090 r ASP f^ c Trp Gly Asn Pro Le « Arg Tyr AsJ Thr 

1095 1100 

CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT ATC TAT ATA AAG TAT 
Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn He Tyr Vie L^s ICr 
(>0 1110 1115 ' 1120 

TTT AGT AAA GCT TCT ATG GGG GAA ACT GCA CCA CGT ACA AAC TTT AAT 
Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr Asn pAe Asn 
1125 1130 1135 



2688 



2736 



2 784 



283: 



2680 



2928 



2976 



Sir IZT ^ T T 3 T AG ° ATT ^ C ^ TGG GTA AGG ACT CCT AAA TAT 

15 9?0 P ^ ^ "* ^ PhC TrP Val Arg Thr *™ ^ Tyr 

AA T AAT GAT ATA CAA ACT TAT CTT CAA AAT GAG TAT ACA ATA ATT 
Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn Glu Tyr Thr lie 

20 5 955 960 

Ser Cvl VT A ?** ** T GAC TCA GGA TGG AAA GTA TCT ATT AAG GGA AAT 
Ser Cys lie Lys Asn Asp Ser Gly Trp Lys Val Ser He Lys Gly Asn 
965 970 975 

25 Aro tT A I 00 ACA TTA ATA GAT GTT AAT GCA AAA TCT AAA TCA ATA 

Arg He Iie Trp Thr Leu He Asp Val. Asn Ala Lys Ser LyV ler tie 
980 965 99 0 

.-JE 5E s: ^ 2sr ?i: ^ ss; ^ *s -s js-js ?s ~* «: »»' 

995 10 °0 1005 

TGG TTT TCC ATA ACT ATT ACT AAT GAT AGA TTA GGT AAC GCA AAT ATT 

35 ^ iofo Ser Ue 116 T^ c ASn ASP A ^ Leu G1 V ^ ^a £Z lie 

1015 1020 

ICr tI A ^1 G ? A AGT AAA AAA AGT GAA AAA ATT TTA AAC TTA GAT 

Tyr lie Asn Gly Ser Leu Lys Lys Ser Glu Lys lie Leu Asn Leu Asp 
40 1030 1035 1040 

AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA TTA ATT AAT TGT ACA 
Arg lie Asn Ser Ser Asn Asp lie Asp Phe Lys Leu III A^n Cvl Thr 
1045 1050 loss 
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316B 



3216 



SO 1 ** T GCT ACA GAA GTA TCT TCA CTA TAT TGG ATT CAA TCA TC- 3 264 

° iU LGU Ala Thr Glu Val Ser Ser Leu Tyr Trp He Gin Ser ler 

1075 1080 10 85 
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1290 1295 

GAA TAA 

Glu 3894 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1297 amino'acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77; 

Tyr Asn Asp Pro He Asn Asn 
10 15 

Vsn Asp Pro Gly Pro Gly Thr 
30 

irg lie Trp He Val Pro Glu 
45 

Sin Phe Asn Ala Ser Thr Glv 
60 y 



7 5 80 
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f 

85 90 95 

Asn Arg lie Asn Ser Lys Pro Ser Gly Gin Arg Leu Leu Asp Met lie 
100 105 no 

! 

Val Asp Ala lie Pro Tyr Leu Gly Asn Ala Ser Thr Pro Pro Asp Lvs 
115 120 125 

Phe Ala Ala Asn Val Ala Asn Val Ser lie Asn Lys Lys lie He Gin 
130 135 140 

Pro Gly Ala Glu Asp Gin He Lys Gly Leu Met Thr Asn Leu He He 
145 150 155 160 

Phe Gly Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met He 
165 170 175 

Met Asn Gly His Ser Pro He Ser Glu Gly Phe Gly Ala Arg Met Met 
180 185 190 

lie Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Asp Thr Ser He Phe Ser Arg Arg Ala Tyr Phe Ala Asp Pro " 
-10 215 220 

Ala Leu Thr Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
225 230 235 240 

Gly lie Lys He Ser Asn Leu Pro lie Thr Pro Asn Thr Lys Glu Phe 
245 250 255 

Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

G\Y Gly His Asp Pro Ser Val life Ser Pro Ser Thr Asp Met Asn He 
275 280 285 

Tyr Asn Lys Ala Leu Gin Asn Phe Gin Asp He Ala Asn Arg Leu Asn 
290 295 300 

He Val Ser Ser Ala Gin Gly Ser Gly He Asd He Ser Leu Tyr Lys 
305 310 315 320 

Gin He Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly He Lys Thr 
355 360 365 

Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro He Lys Thr Glu Lys 
3*70 375 380 

Leu Leu Asp Asn Thr He Tyr Thr Gin Asn Glu Gly Phe Asn He Ala 
385 390 395 400 

Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 
405 410 415 

Lys' Glu Ala Tyr Glu Glu lie Ser Leu Glu His Leu Val He Tyr Arg 
420 425 430 

He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 
435 440 445 

Gin Cys He He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lys 
450 455 460 
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.; m Ly , A , P A1 „ , y . Ala Thr rie m> ^ ^ 

Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser il„ * o n 

485 ;«« er Ile As P Gln Leu Ue 

4 95 

Leu Asp Asn Asp Leu Ser Ser riv » 

500 ° ly As P ^ Asn Glu Asn Thr 

510 

p~ jj. xhr A , n Ph . A „ p i;? Il# ^ ^ ^ ^ ^ 

S.J A1 , L . u Lys L> , „. ^ ^ ^ ^ ^ ^ 

K — «i. .«. ta. »j PK. P„ „ r A „„ , g ^ Lm Mn ^ 

«. A,„ fcr „„„ A , p Al , ^ W ^ ^ ^ ^ ^ ^ 

-h. Ph. S.r TJj As „ val Glu ; , M , Aa|) ^ vh 

590 

- IK V " ^ ~ £ «' ~ iU Asp Phe T „ r s „ 

6.05 

<Uu ser Thr Gln Lys ser ^ r Ile Asp ^ ^ ^ 

lie He Pro Tyr Ile Gly pro ^ ^ ^ ^ ^ AsnGlu ^ r ^ 

L yo aiu Asn Phe Asn Aia phe Giu ^ Q J Giy a ^ a ^ ^ 
Met ciu Phe xi. Pro olu Leu Ile m Pro Ile vai Giy ^ ^ ^ 

^ K AS " Gly Hi. Xle Xle Mec Z Ile 8 „ 

6 85 

Asn Al. Leu Lys Lys Arg Asp Qln Lys Trp ^ ^ ^ ^ ^ ^ 

tie v.l ser Gln Trp Lfu Ser Tnr Val ' Asn ^ ^ phe ^ 

Ly. Glu Ar g MeC Tyr Asn Ala Leu Asn ^ ^ ^ ^ ^ ^ 

Lys ile lie Giu Asp Cln Tyr Asn Arg Tyr Ser Glu Glu ftsp ^ ^ 

750 

Asn xle Asn xle Asp Phe Asn A . g Ile ftsp ^ ^ ^ ^ 

Xle A .„ L eu Aia lie Asn A sn xle Asp Asp Phe xle Zl cm c ys Ser 

lie Ser T V r X.eu Mec Asn Ar 9 Met Xle Pro Leu All Val Lys Lys L eu 

L yo Asp Phe Asp Asp Asn ,.u , ys Ar 9 Asp Z Leu Glu Tyr Ile Zl 

010 815 
Tnr Asn « u ^ Tyr Leu Leu ftsp ^ ^ ^ ^ ^ ^ ^ ^ 

Val Asn Arg His L eu Lys Asp Ser IIe Pro phe Asp ^ ^ ^ ^ 
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835 840 845 

Thr Lys Asp Thr He Leu He Gin Val Phe Asn Asn Tyr lie Ser Asn 
850 855 860 

He Ser Ser Asn Ala He Leu Ser Leu Ser Tyr Arg Gly Gly Arg Leu 
865 870 875 ' 880 

He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp Val 
' 885 890 895 

He Phe Asn Asp lie Gly Asn Gly Gin Phe Lys Leu Asn Asn Ser Glu 
900 905 910 

Asn Ser Asn lie Thr Ala His Gin Ser Lys Phe Val Val Tvr Asp Ser 
915 920 925 

Met Phe Asp Asn Phe Ser He Asn Phe Trp Val Arq Thr Pro Lvs Tvr 
^ () 930 935 940 * 

Asn Asn Asn Asp Tie Gin Thr Tyr Leu Gin Asn Glu Tyr Thr He He 
945 950 955 960 

Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val Ser He Lys Gly Asn 
-> 965 970 975 



30 



Arg He He Trp Thr Leu He Asp Val Asn Ala Lys Ser Lvs Ser He 
980 985 990 

Phe Phe Glu Tyr Ser He Lys Aso Asn He Ser Asp Tvr lie Asn Lys 
995 1000 1005 

Trp Phe Ser He Thr He Thr Asn Asp Arg Leu Gly Asn Ala Asn He 
1010 1015 1020 

Tyr He Asn Gly Ser Leu Lys Lys Ser Glu Lys He Leu Asn Leu Asp 

iQ 25 1030 1035 1040 

Arg He Asn Ser Ser Asn Asp lie Asp Phe Lys Leu lie Asn Cys Thr 
4U 1045 1050 1055 

Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe Asn He Phe Gly Arg 
1060 1065 1070 

4> Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tvr Trp He Gin Ser Ser 

1075 1080 1085 



50 



60 



65 



70 



Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro Leu Arg Tyr Asp "Thr 
1090 1095 1100 

Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn He Tyr He Lys Tyr 
±105 mo ins 1120 

Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr Asn Phe Asn 
H25 H30 1135 

Asn Ala Ala He Asn Tyr Gin Asn Leu Tyr Leu Gly Leu Arg Phe He 
1140 1145 1150 

He Lys Lys Ala Ser Asn Ser Arg Asn He Asn Asn Asp Asn He Val 
1155 1160 H65 

Arg Glu Gly Asp Tyr He Tyr Leu Asn He Asp Asn He Ser Asp Glu 
1170 1175 H80 

Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys Glu He Gin Thr Gin 
^85 1190 1195 1200 

Leu Phe Leu Ala Pro He Asn Asp Asp Pro Thr Phe Tyr Asp Val Leu 
1205 1210 * 1215 
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1225 1230 



^ Ss^ Thr «*» Th ' -V ly Leu Phe rifely 
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1260 X 
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* X280 
~rg Leu Gly Cys Asn Trp Gin Phe lie Pro Val Asd g^u n 

1285 .15 on y Tr P Thr 



1290 1295 



Glu 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS - 

<A) LENGTH; i5 35 base pairs 
<B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

■ii) MOLECULE TYPE: DNA (genomic) 

■ixj FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 108 .. 1526 

<:<i> SEQUENCE DESCRIPTION: SEQ ID NO:78: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 
TTCCCCTCTA G AAAT AAT TT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 

Met Gly His 

85 ~ - - " «H - S iS £ SKfE S S- 
SS K S S 2? SI 2S S 31 E ™ sr SH SI IS 
XI "I SI S XI JK K S XI S SI S S E S ™ 

ss s? q k s xi s s « - » « » ™ - 
s ss e js sit ?r: - ^ - s ~ « » 2 „ » 
™ xi xi m s; «j «j }jj ?s «» r 2 m M m 

85 9 q 1Ae Tnr Ala His Gin Ser Lys Phe 

si sit i- s; K S S XI XI 51 s » £ - «; -» 

^ s s 5: s xi xi xi xi k £ «? - - =» « 
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4 04 
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45 



65 



GA ~ AT ATA ATT AGT TGT ATA AAA AAT GAC TCA GGA TGG AAA GTA 

Glu Tyr Thr lie n e Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val 
135 140 145 

I5r tTI f^o £? A ^1 AGA A ™ A ™ TGG ACA TTA ATA GAT GTT AAT GCA 

» »i . Asp 

160' 



-_- .... «"i «u« ftW 1W ACA TTA ATA GAT GTT AAT GCA 

Ser He Lys Gly Asn Arg He Ilo Trp Thr Leu He Asp Val Asn Ala 



-" ■ ~, "A AAA GAT TTT TGG GGG AAT CCT 
^ ^£ IIe Gln Ser Ser Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro 

26S 270 275 

TTA AGA TAC GAT ACA CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT 

Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn 

4() 280 285 290 

?T£ I AT ?T A f* 5 I AT AGT **A TCT ATG GGG GAA ACT GCA CCA 

He Tyi He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro 

295 300 



305 



CGT ACA AAC TTT AAT AAT GCA GCA ATA AAT TAT CAA AAT TTA TAT CTT 
Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn Leu Tyr Le^ 



320 



330 335 

AAT GAT AAT ATA GTC AGA GAA GGA GAT TAT ATA TAT CTT AAT ATT GAT 

55 llo P AS " V ^ til G1U Gly ASP ^ Ile ^u A^n 111 2J 

5 3 50 3 55 

AAT ATT TCT GAT GAA TCT TAC AGA GTA TAT GTT TTG GTG AAT TCT AAA 
Asn lie Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu Val 2sn Ser Lyt 
Ml 360 365 370 

GAA ATT CAA ACT CAA TTA TTT TTA GCA CCC ATA AAT GAT GAT CCT ACG 
Glu He Gin Thr Gin Leu Phe Leu Ala Pro lie Asn Asp Asp Pro Thr 
3 '5 3go 



385 



TTC TAT GAT GTA CTA CAA ATA AAA AAA TAT TAT GAA AAA ACA ACA TAT 
Phe Tyr Asp Val Leu Gin Ile Lys Lys Tyr Tyr Glu Lys Thr Shr Tyr 

390 395 



400 



54B 



596 



AAA TCT AAA TCA ATA TTT TTC GAA TAT AGT ATA AAA GAT AAT ATA TCA 
10 Lys Ser Lys Ser He Phe Phe Glu Tyr Ser lie Lys Asp Asn lie Jer 

-i- 0 - 3 170 175 

GAT TAT ATA AAT AAA TGG TTT TCC ATA ACT ATT ACT AAT GAT AGA TTA 

15 ?80 Yr 16 ASn LyS ?S Phe Sel " Ile Thr Ile Thr Asn Asp ArJ lIu 

190 195 

GGT AAC GCA AAT ATT TAT ATA AAT GGA AGT TTG AAA AAA AGT GAA AAA 
.Gly Asn Ala Asn lie Tyr lie Asn Gly Ser Leu Lys Lys Ser Glu 
2() 200 20S 210 

ATT TTA AAC TTA GAT AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA 
xle Leu Asn Leu Asp Arg lie Asn Ser Ser Asn Asp Ile Asp Phe Lys 
-15 220 225 

25 TZ» tV ^ J- GT A w A GAT ACT ACT *** m GTT TGG ATT AAG GAT TTT 

Leu lie Asn Cys Thr Asp Thr Thr Lys Phe Val Trp lie Lys Asp Phe 

235 240 

M) tT T AGA GAA ** T GCT ACA GAA GTA TCT TCA CTA TAT 

M Asn lie Phe Gly Arg Glu Leu Asa Ala Thr Glu Val Ser Ser Leu Tyr 

A f J £AA TCA TCT ACA AAT ACT TTA AAA GAT TTT TGG GGG AAT CCT 
^t_„ ~, Asp Phe Trp Gly Asn Pro 

270 275 



6 92 



740 



788 



836 



884 



932 



980 



1028 



1076 



GGT TTA CGA TTT ATT ATA AAA AAA GCA TCA AAT TCT CGG AAT ATA AAT li-> 4 
Gly Ar 9 Phe Ile "e Lys Lys Ala Ser Asn Ser Arg £n lie 



1172 



1220 



1268 



1316 



70 rZl rT A f" TG ° GAA *** GAT ACT ACA TTT GGG CTG TTT i364 

Asn Cys Gin lie Leu Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe 
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4 05 410 

4-1 0 415 



15 S & 52 K SS5a«££j*£ i K S 



445 450 



.1412 



1460 



1508 



sr a is ig » « ™ - - & » ™ jtc s £ c „■ 

460 46 5 

GAT GAA GGA TGG ACA GAA TAACTCGAG 

Asp Glu Gly Trp Thr Glu 1535 
470 

(2> INFORMATION FOR SEQ ID NO: 79: 

U) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 473 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79= 
Met Gly His His His His His His His His His His Ser Ser Gly His 

10 15 
He Glu Gly Arg His Met Ala Ser Met Ala Asp Thr Jle Leu ^ ^ 

25 30 
Val Phe Asn Asn Tyr He Ser Asn Jle ser Ser Asn Ala He Leu Ser 

40 45 

Ceu Ser Tyr Arg Gly G1 y Arg Leu He Asp Ser Ser Gly Tyr Gly Ala 

" 60 
Thr Met AS „ val Gly Ser. Asp Val He Phe Asn Asp Ile Gly Asn Gly 

75 80 

Gin Phe Lys Leu Asn Asn Ser Glu Asn Ser Asn He Thr Ala His Gin 

Ser Lys Phe val Val Tyr Asp Ser Met PheAsp Asn Phe Ser Z Asn 

105 110 
Phe Trp Val Arg Thr Pro Lys Tyr Asn Asn Asn Asp lie cm T hr Tyr 

125 

Leu Gin Asn Glu Tyr Thr lie lie Ser Cys lie Lys Asn Asp Ser Gly 

1-Jb 14Q 

Trp Lys val Ser lie Lys Gly Asn Arg lie „. Trp Tnr Leu Ile Asp 

val Asn Ala Lys Ser Lys Ser lie Phe Phe Glu Tyr Ser lie Lys Asp 

170 175 
Asn lie Ser Asp Tyr lie Asn Lys Trp Phe Ser lie Thr lie Thr Asn 

185 190 
Asp Arg Leu Gly Asn Ala Asn lie Tyr lie Asn Gly Ser Leu Lys Lys 

* c * 00 205 

Ser Glu Lys lie Leu Asn Leu Asp Arg lie Asn Ser Ser Asn Asp il e 

215 220 
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Asp Phe Lys Leu He Asn Cys Thr Asp Thr Thr Lys Phe Val Trp He 
225 230 235 . 240 

Lys Asp Phe Asn He Phe Gly Arg Glu Leu Asn Ala Thr Glu Val Ser 
245 250 255 

Ser Leu Tyr Trp He Gin Ser Ser Thr Asn Thr Leu Lys" Asp Phe Tr D 
260 2 65 270 

Gly Asn Pro Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Glv 
275 280 285 

Met Gin Asn He Tyr He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu 
290 295 300 

Thr Ala Pro Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn 
305 310 315 320 

Leu Tyr Leu Gly Leu Arg Phe He He Lys Lys Ala Ser Asn Ser Aro 
325 330, 335 

Asn He Asn Asn Asp Asn lie Val Arg Glu Gly Asp Tyr He Tyr Leu 
340 345 350 

Asn He Asp Asn He Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu Val 
355 360 365 

Asn Ser Lys Glu He Gin Thr Gin Leu Phe Leu Ala Pro lie Asn Asd 
370 375 380 

Asp Pro Thr Phe Tyr Asp Val Leu Gin He Lys Lvs Tyr Tyr Glu Lvs 
385 390 395 * 400 

Thr Thr Tyr Asn Cys Gin He Leu Cys Glu Lys Asp Thr Lys Thr Phe 
405 410 415 

Gly Leu Phe Gly He Gly Lys Phe Val Lys Asp Tyr Gly Tyr Val Trp 
420 425 430 

Asp Thr Tyr Asp Asn Tyr Phe Cys He Ser Gin Trp Tyr Leu Arg Arq 
435 440 445 

He Ser Glu Asn He Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe 
45 0 455 460 

He Pro Val Asp Glu Gly Trp Thr Glu 
465 470 

(2) INFORMATION FOR SEQ ID NO: 80: 

( i i SEQUENCE CHARACTERISTICS : 
(A> LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:80: 

CGCCATGGCT GACACAATTT TAATACAAGT 

(2) INFORMATION FOR SEQ ID NO: 81: 

<i> SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : other nucleic a HH ' 
(A) DESCRIPTION s /desc= «DNA« 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GCCTCGAGTT ATTCTGTCCA TCCTTCATCC AC I 
(2) INFORMATION FOR SEQ ID NO: 82: 

<±> SEQUENCE CHARACTERISTICS * 

(A) LENGTH; 12 amino acids 

(B) TYPE: amino acid 

m? ^^ NDEDNESS: not relevant 
■(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

<ix) FEATURE: 

(A) NAME /KEY: Modif ied-si te 

(B) LOCATION: 12 

(D) OTHER INFORMATION- /nnr« 

chx S position contains an group.- The ■«P*« 9 i M residue at 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:82: 
CVS ui» T hr He Asp Cly Lys Lys Tyr Tyr ^ 

10 
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CLAIMS 

I . A host cell containing a recombinant expression vector, said vector encoding a 
protein comprising at least a portion o( a Clostridium boiulinum toxin, said toxin selected 
5 from the group consisting of type B toxin and type E toxin. 

2. The host ceil of Claim 1 . wherein and said host cell is capable of expressing 
said protein at a level greater than or equal to 5% of the total cellular protein. 



10 



15 



20 



25 



30 



3. The host cell of Claim I, wherein and said host cell is capable of expressing 
said protein as a soluble protein at a level greater than or equal to 0.25% of the total soluble 
cellular protein. 

4: The host ceil of Claim 1 . wherein said host cell is an Escherichia caii cell. 

5. The host cell of Claim 1: wherein said host cell is an insect cell. 

6. The host cell of Claim I. wherein said host cell is a yeast cell. 

7. A host cell containing a recombinant expression vector, said vector encoding a 
fusion protein comprising a non-toxin protein sequence and at least a portion oi l a Clostridium 
hoiulimim toxin, said toxin selected from the group consisting of type B toxin and type F- 
toxin. 

8. The host cell of Claim 7. wherein said portion of said toxin comprises the 
receptor binding domain. 

9. The host cell of Claim 7, wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

10. A vaccine comprising a fusion protein, said fusion protein comprising a non- 
toxin protein sequence and at least a portion of a Clostridium hoi ul mum toxin, said toxin 
selected from the group consisting of type B toxin and type E toxin. 
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^ h The y^ of Claim ,0 ^ comprising a fusion p^n comprisi^ 8 „„, 
loxm protem sequence and at .cast a portion of dos^un, Hoiulinurn type A toxin. 

12. The vaccine of Claim ,0. wherein said portion of said Clos^iun, Hotulinun, 
toxm comprises the receptor binding domain. 

13^ The vaccine of Claim ,0 wherein said non-toxin protein sequence comprises a 
poly-histidtne tract. "»«pnses a 



«4. The vaccine of Caim .0. wherein said vaccine is substantially endotoxin-free. 

15. A method of generating antibody directed against a Uos^iun, ^ulinun, 
toxin comprising: 

a) providing in any order: 

0 an antigen comprising a fusion protein comprising a noh-toxin 
protein sequence and at leas, a portion of a C,o Mi „ m holulinum toxin ^ 
«oxm selected from the group consisting of type B toxin and type £■ toxi , and 

") a host; and 

b) immunizing said hos, with said antigen so as to generate an antibody. 

16. The method of Claim ,5. wherein said antigen further comprises a fusion 
nrotem comprising a non-toxin protein sequence and at least a portion of C Mium 
hoiuimum type A toxin. 

17. The method of Chum 15. wherein said portion of said Clostridiun, borulinum 
toxm comprises the receptor binding domain. 

18. The method of Claim 15 wherein dirt . 

w ^rem said non-toxin protein sequence comprises a 

poly-histidine tract. 



1 9. The method of Claim 1 5 wherein said host is a mammal. 

20. The method of Claim 19 wherein said mammal is a human. 
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21. The method of Claim 15 further comprising step c) collecting said antibodies 
from said host 

22. The method of Claim 21 further comprising step d) purifying said antibodies. 

23. The antibody raised according to the method of Claim 15. 

24. The antibody raised according to the method of Claim 16. 
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FIGURE 3 
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FIGURE 4 
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FIGURE 5 
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FIGURE 6 




Fragment 1 Fragment 3 



EcoRJ PstI 
_1 f 



P3 Fragment Z 



P1-P4 *re PCR primers 1-4. PI =5 , GGAAATTTAGCTGCAGCATCTGAC3 , I 
PZ=STCTAGCAAATTCG(nTGTOTTGAA3\P3=5 , CT 

P4=5 f CTATCTAGGCCrAAAGTAT3 < . Indicated restriction sites in fragments 1 and 2 are internal 
sites used to done into pGEX2T vector (fragment 1 ; construct called pGA3O-660) or pMALc 
vector (fragment 2; construct called pMA660-l 100). Bracketed restriction sites at ends of 
fragment 3 are pUC9 polytinker sites utilized to done fragment 3 into pET23 vector 
(construct called pPAl 1 00-2680). Numbers in these constructs refer to toxin A amino add 
interval that is expressed The shaded portion of the toxin A gene corresponds to the repeating 
itgand binding domain. 
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FIGURE 8 
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pP refers to pET23 vector, pM refers to pMALc vector. A refers to toxin A, and numbers refer to 
amino acid interval expressed in done. Endpoints of clones correspond to indicated restriction sites 
shown of toxin A map. 
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FIGURE 9 
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FIGURE 10 
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FIGURE 13 
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FIGURE 16 
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FIGURE 18 
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FIGURE 19 
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FIGURE 22 
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FIGURE 23 
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1 2 3 4 5 6 7 




WO 93/08540 i 

1 PCT/US97/15394 



FIGURE 27 
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FIGURE 28 
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FIGURE 29 
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FIGURE 30 
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FIGURE 31 
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FIGURE 32 
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FIGURE 33 



1 2 3456 78 




, ■ . • • ■ • . I 

WO 98/08540 PCT/US97/15394 



FIGURE 34 
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FIGURE 35 
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FIGURE 36 
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FIGURE 37 
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FIGURE 38 
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FIGURE 39 
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FIGURE 40 
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